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1 1 ems Descr i pt i on 
111912 ( SPEECH CR LANGUAGE CR VQ CE CR AUDI TORY) ( 2N) ( RECOGNI Tl CM - 
CR RECOGNI Z? CR RECOGNI S? CR PROCESS? CR I NPUT? CR OUTPUT? CR 
SYNTHES?) 

GRAMVAR CR VOCABULARY CR Dl CTI CNAR? OR LEXI CON? 
ENTRY CR ENTRI ES CR VCRD? ? OR UTTERANCE? ? 
Fl LE CR Fl LES CR FOLDER? ? CR PROGRAM? ? CR PROGRAMME OR A- 
PPLI CATI ON? ? CR EXECUTABLE 

USER CR USERS CR CLI ENT? ? CR SUBSCRI BER? ? 
S5( 4N) ( SPEAK??? CR ENUNCI AT??? CR VOCALI Z??? CR SAYS CR ST- 
AT??? CR UTTER???) 

S4(5N) (OPEN??? CR RUN CR RUNS CR EXECUT??? CR START??? CR - 
LAUNCH??? CR ACCESS??? CR ACTI VAT???) 

( VO CE CR LANGUAGE) ( 2N) ( CONTROL? CR ACTI VAT? CR CPERAT? CR 
CCMVAND?) 

CPERATI NG( ) SYSTEM? 

AU=( MAJOR, A? CR VAJCR A? CR W\NDI NGER, M? CR \AANDI NGER M?) 
S10 AND S1 
S2(30N) S3 

512 AND S1 
S6 AND S3 

513 AND S14 

51 5 AND ( S7 CR S8) 

516 AND I C=G10L? 
S13 AND S9 
S18 AND I C=G10L? 
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0016699175 - Drawing available 
VNPI ACC NQ 2007-414255/200740 
XRPX Acc No: N2007-311092 

Adaptive speech recognition nret hod for controlling e.g. nrobi I e phone, 
involves transnitting user- dependent adj ust rrent par arret ers for speech 
recognition to input device, and carrying out speech recognition ol 
user based on paranreters 

Patent Assignee: SI EMENS AG ( SI El ) 
I nvent or : GUI TARTE P J F; LUKAS K; 
Patent Fanri I y (1 patents, 1 count ri 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

DE 102005041618 A1 20070315 DE 102005041618 A 20050901 200740 B 

Pri ori ty Appl i cat i ons (no. , kind, date): DE 1 02005041 61 8 A 20050901 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

DE 102005041618 A1 DE 1 3 5 

Adaptive speech recognition nret hod for controlling e.g. mobile phone, 
involves transnitting user- dependent adj ust rrent par arret ers for speech 

recognition to input device, and carrying out speech recognition of 
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user based on par arret ers 
... I nvent or : VWSIDI NGER M 

Al er t i ng Abst r act ... NO/EL TY - The rret hod i nvol ves t r ansrri it t i ng 
user - dependent adjustment par arret ers for a speech recognition from a 
central server (4) to input devices (2, 6) vi a a corrrruni cat i on channel (5). 
The speech recognition of a user (8) is carried out i n t he input 
devices based on the user... 

...processing by the server unit to determine the user - dependent adjustment 
parameters corresponding to a speech recognition model . DESCRI PTI CN - An 
I INDEPENDENT CLAI Mis al so i ncl uded f or a syst em f or adapt i ve speech 
recognition , corrprising an input device... 

...USE- Used for an adaptive speech recognition of a user to control 
and provide instruction input for devices such as mobile phone... 

...ADVANTAGE - The rret hod provides speech recognition adapted to a user 
in different devices, without requiring a user to carry out an individual 
adjustment with each device, thus reducing the expenditure for adaptation 
of speaker - dependent speech recognition . 



. . . DESCRI PTI ON CF DRAW NGS - The drawi ng shows a scherrat i c vi ew of a 
speech recognition system 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor name & address: 

... Vtendinger, M chael , 81829 Mjnchen, DE 



Dl ALOG( R) Fi I e 350: Derwent VNPI X 
(c) 2008 The Thomson Corporation. All rts. rese 

0015314455 - Drawing available 
VNPI ACC NG 2005-664663/200568 
XRPX Acc No: N2005- 544380 
Voice recognition for portable i nst rurrent s, 
like, establishes a confidence rreasure and threst 
and speech adapt i ve syst ems 
Patent Assignee: SI EMENS AG ( SI El ) ; SCHRCER A ( SCHR- 1 

( STRA- I ) ; V\ANDI NGER M ( WAND- I ) 
I nvent or : SCHNEI DER T; SCHROEER A; STRANART T; WNDI \ 
Patent Farri I y (3 patents, 107 count 



t el ephones and t he 
hofd for speech dependent 



STRANART T 
HI ; SCHRCER A 



Pat ent 
Nurrber 

WJ 2005088607 
EP 1723636 



Ki nd Dat e 
A1 20050922 
A1 20061122 



US 20070213978 A1 20070913 



Appl i cat i on 
Nurrber 

WJ 2005EP50342 
EP 2005707860 
WJ 2005EP50342 
WJ 2005EP50342 
US 2006592526 



nd Dat e 
A 20050127 
A 20050127 
A 20050127 
A 20050127 
A 20060912 



Updat e 
200568 B 
200677 E 



Pri ori ty Appl i cat i ons (no. , kind, date): DE 1 0200401 2206 A 20040312 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

WJ 2005088607 A1 DE 21 3 

Nat i onal Desi gnat ed St at es, Cr i gi nal : AE AG AL AM AT AU AZ BA BB BG BR BW 
BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR 
HU I D I L I N I S J P KE KG KP KR KZ LC LK LR LS LT LU LV M*\ MD M3 M< MM M/V 
MK MZ NA Nl NO NZ CM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN 
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TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW 
Regi onal Desi gnat ed St at es, Cr i gi nal : AT BE BG BWCH CY GZ DE DK EA EE ES 

Fl FR GB GH GM GR HU I E I S I T KE LS LT LU IvC Mi/V MZ NA NL GA PL PT RO SD 

SE SI SK SL SZ TR TZ UG ZM ZW 
EP 1723636 A1 DE PCT Appl i cat i on WJ 2005EP50342 

Based on CPI patent WJ 2005088607 
Regi onal Desi gnat ed St at es, Cr i gi nal : DE FR GB I T 

US 20070213978 A1 EN PCT Appl i cat i on WJ 2005EP50342 

Voice recognition for portable i nst rurrent s, e.g. telephones and the 
like, establishes a confidence treasure and threshold... 
... I nvent or : VWSIDI NGER M 

Al er t i ng Abst r act . . . NCVELTY - During voice recognition , at a 

portable instrument, a norri nal recognition can be derived or be given. The 

voice is recognized , and a confidence measure is deter rrined for the 

norri nal recognition, giving a defined confidence threshold... 

USE - The voice recognition system is for use wi t h a var i et y of 

por t abl e i nst r urrent s e.g. rrobi I e t el ephones, per sonal . . . 

. . . DESCRI PTI ON CF DRAW NG3 - The drawi ng shows a f I ow chart of t he voi ce 
recogni t i on st ages i n a speech adapt i ve t r ai ni ng pr ogr am wi t h mat chi ng 
confidence thresholds. (The drawing includes non. . . 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor narre & address: 
. . . W\NDI NGER, M chael . . . 

. . . Vtendi nger , M chael . . . 

. . . W\NDI NGER, M chael 
Cr i gi nal Abst ract s: 

Dur i ng a voi ce r ecogni t i on , at heor et i c i dent i f i cat i on r esul t is 
deduci bl e or already given. The recognition process is carried out... 

. . . Dur i ng a voi ce r ecogni t i on , at heor et i c i dent i f i cat i on r esul t is 
deduci bl e or already given. The recognition process is carried out... 

...During a voice recognition , a theoretic identification result is 
deduci bl e or already given. The recognition process is carried out... 
a ai nrs: 

<b>1</ b>- <b>9</ b>. ( cancel ed) <b>10</ b>. A met hod for voice recognition , 
corrpr i si ng: deduci ng and/ or speci f yi ng at heor et i c i dent i f i cat i on r esul t 
for a recognition process of a voice recognition syst em r ecei vi ng a 
spoken theoretic identification result froma user; andper f or rri ng the 
r ecogni t i on pr ocess. . . 



Dl ALOG( R) Fi I e 350: Derwent WI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0014393762 - Drawing available 
WI AGO NG 2004-583104/200457 
XRPX Acc No: N2004- 460869 

Speech recognition rret hod f or a rrobi I e t er rri nal , e.g. PDA, lvP3- player, 
rrobi I e phone, in which background noise profiles are stored and selected 
for use in conjunction with the speech recognition process 
Patent Assignee: SI EMENS AG ( SI El ) 
Inventor: §HHW ; SCHRCEER A 
Patent Fam I y (3 patents, 31 countries) 
Pat ent Appl i cat i on 
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Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

EP 1445759 A1 20040811 EP 2004100134 A 20040116 200457 B 

DE 10305369 A1 20041104 DE 10305369 A 20030210 200472 E 

DE 10305369 B4 20050519 DE 10305369 A 20030210 200534 E 

Pr i or i t y Appl i cat i ons (no., kind, date): DE 10305369 A 20030210 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

EP 1445759 A1 DE 9 3 

Regi onal Desi gnat ed St at es, CM gi nal : AL AT BE BG CH CY CZ DE DK EE ES FI 
FR GB GR HU I E I T LI LT LU LV IVC IvK NL PT RO SE SI SK TR 

Speech recognition nret hod f or a rrobi I e t er rri nal , e.g. PDA, IVP3- pi ayer , 
mobile phone, in whi ch background noise profiles are stored and selected 
for use in conjunction with the speech recognition process 

Cr i gi nal Ti 1 1 es: 

...User adaptive method for modeling of background noise in speech 
recogni t i on 

I nvent or : MMCR A R . . . 

AI er t i ng Abst r act ...NOVELTY- Speech recognition rret hod i n whi ch a 

nurrber of background noise profiles are stored and provided and one is 
selected to be used during a speech recognition process . ... 
Background noise profiles can be stored by users and used to improve and 
train their speech recognition applications... 

...USE- Speech recognition rret hod f or a rrobi I e t er rri nal , e.g. PDA, 
IVP3- pi ayer , rrobi I e phone. . . 

. . . ADVANTAGE - Speech recogni t i on i s i rrpr oved. . . 
Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor narre & address: 
IV&jor, Andreas Ralph, 81677 Mjnchen, DE . 



Ivfijor, Andreas Ralph, 81677 Minchen, DE . 
Ivfij or , Andreas Ral ph 



" 11/3, KM (Hem 4 f r om. file. 350) 

D! ALOa R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0014112337 - Drawing available 
Wl ACC NQ 2004-296669/200428 
XRPX Acc No: rx£004- 235662 

Voice - based text input for pre- i nst al I ed applications of mobile devices 
e.g. mobile phones, involves voice signal input 
voice recognition devi ce i nt o t ext ural units 
Patent Assignee: SI ErVENS AG ( SI El ) 
Inventor: HARENGEL S; MM 
Patent Fam I y (2 patents, 106 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd 

DE 10337822 A1 20040325 DE 10337822 A 

VO 2005020211 A1 20050303 VO 2004EP51 753 A 



wi t h conversi on by 



Dat e 
20030818 
20040809 



Updat e 
200428 B 
200519 E 



Pri ori ty Appl i cat i ons (no. , kind, date): DE 10337822 A 20030818 



10501857 Language Recognizer And Operating IVet hod Therefor 



Pat ent Det ai I s 

Nurrber Kind Lan Pg Dvug Filing Notes 

DE 10337822 A1 DE 5 1 

VC 2005020211 A1 DE 

Nat i onal Desi gnat ed St at es, Cr i gi nal : AE AG AL AM AT AU AZ BA BB BG BR BW 
BY BZ GA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES Fl GB GD GE GH GM HR 
HU I D I L I N I S JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD M3 M< WN M/V 
MK MZ NA Nl NO NZ CM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR 
TT TZ UA UG US UZ VC VN YU ZA ZM ZW 

Regi onal Desi gnated States, Cr i gi nal : AT BE BG BW CH CY CZ DE DK EA EE ES 
Fl FR GB GH GM GR HU I E I T KE LS LU M3 MN MZ NA NL OA PL PT RO SD SE SI 
SK SL SZ TR TZ UG ZM ZW 

Voice - based text input for pr e- i nst al I ed applications of mobile devices 
e.g. mobile phones, involves voice signal input with conversion by 
voice recognition devi ce i nt o t ext ural units 

Cr i gi nal Ti 1 1 es: 

... VO CE - ASSI STED TEXT I NPUT FOR PRE- I NSTALLED APPLI CATI CNS I N MCBI LE 
DEVI CES 

... I nvent or : IvAJGR A R 

Al er t i ng Abst r act ...NOVELTY - A pr ocedur e f or voice recognition for 

an appl i cat i on t aki ng pi ace sequent i al I y on an oper at i on syst em and i n 
which the application is ready for receiving textural units from the 
operating syst em and i n whi ch a voice signal is input and is converted 
by a voice recognizer into textural units, which are then handed over 
t o t he operation syst em f r om whi ch the... 

. . . DESCRI PTI CN OF DRAW NG3 - A pr ocedur e f or voice recognition vi a a 
f I ow- di agr am i s represented. (Contains non- English language text... 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor narre & address: 

... Major, Andreas Ralph, 81677 Mjnchen, DE . . . 

... MMGR, Andreas Ralph 
Cr i gi nal Abst ract s: 

. . . Accor ding to the i nvent ion, an i nt er posed voi ce - r ecogni t i on 
application allows letters or corrrrands to be input into any applications 
running on an operating system To achieve this, the operating system 
transfers said letters or corrrrands to the... 



: ■ A. ,V. : . : o 350: Derwent Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0014022886 - Drawing available 
Wl ACC NO 2004-204650/200420 
XRPX Acc No: NE004- 162597 
Voice recognition involves searching f or/ r ecogni zi ng successive 
instruction words using transition probabilities, recognizing fill words, 
carrying out corrrrand if i nst r uct i on wor d f ul I y recognized 
Pat ent Assi g^e^^S^E^NS AG ( SI El ) 

Patent Farri I y (4 patents, 30 countries) 
Pat ent 

Nurrber Ki nd Dat e 

EP 1391877 A1 20040225 

EP 1391877 B1 20071010 

DE 50211050 G 20071122 



Appl i cat i on 
Nurrber 

EP 200218721 
EP 200218721 
DE 50211050 
EP 200218721 



Ki nd 
A 



Dat e 
20020821 
20020821 
20020821 
20020821 



Updat e 
200420 
200766 
200777 
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ES 2291403 T3 20080301 EP 200218721 A 20020821 200821 E 

Pri ori ty Appl i cat i ons (no. , kind, date): EP 200218721 A 20020821 
Pat ent Det ai I s 

Number Kind Lan Pg Dwg Filing Notes 

EP 1391877 A1 DE 11 3 

Regi onal Desi gnat ed St at es, Cr i gi nal : AL AT BE BG CH CY CZ DE DK EE ES Fl 

FR GB GFi I E I T LI LT LU LV IVC IvK NL PT RO SE SI SK TR 
EP 1391877 B1 DE 

Regi onal Desi gnat ed St at es, Cr i gi nal : DE ES FR GB 

DE 50211050 G DE Application EP 200218721 

Based on CPI patent EP 1391877 
ES 2291403 T3 ES Application EP 200218721 

Based on CPI patent EP 1391877 

Voice recognition involves searching f or/ recogni zi ng successive 
instruction words using transition probabilities, recognizing fill words, 
car ryi ng out . . . 

Cr i gi nal Ti 1 1 es: 

... Ivet hod for speech recognition 

... Ivet hod for speech recognition 
I nvent or : WMMDI NIGER M 

Alerting Abstract ...defined sequence of instruction words ( BF1 , . . ) and 
fill wor ds ( FW bet ween i nst r uct ion wor ds to a voi ce r ecogni t i on 
device, searching for and recognizing successive instruction words using 
associ at ed t r ansi t i on pr obabi I i t i es ( V\)f , r ecogni zi ng fill... 
USE - For voice recognition . 

...ADVANTAGE - Developed for natural speech input with increased voice 
recognition performance corrpared to conventional methods... 

. . . DESCRI PTI ON CF DRAW NGS - The dr awi ng shows a scherrat i c r epr esent at i on 
of an inventive voice recognition rret hod (Drawing includes non- English 
t ext 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 

Inventor narre & address: 
Wandi nger , M chael . . . 

. . . Wandi nger , M chael 
a ai rrs: 

... Ivet hod for speech recognition - wi t h whi ch a spoken sentence of a 
user, which corrprises a comrand word sequence made... 

... as a nuirber of f i I I er wor ds ar r anged bet ween t he comrand wor ds, i s f ed 
to a speech recogni ser ,- wi t h whi ch wi t h t he ai d of the speech 
recogni ser , consecutive comrand words are sought and recognised on the 
basis of assigned transition probabilities,- with which filler words are 
recognised wi t h t he ai d of the speech recogni ser and- with which a 
correspondingly assigned comrand is executed i n t he case of a corrpl et el y. . . 

Dl ALCQ S Fi I e 350- S LnPvP! X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
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0014022716 

VNPI ACC NQ 2004-204477/200420 
XRPX Acc No: N2004- 162475 

Speech recognition nret hod e. g. for rrobi I e t el ephone, i dent i f i es whi ch 
spoken variants of sarre word can be recognized with analysis of recognition 
difficulty for I i rri t i ng nurrber of acceptable variants 

Pat ent Assi gnee: SI EMENS AG ( SI El ) ; STEI WvASSL B ( STEI - I ) ; STEI NINASSL K 
( STEI - I ) ; SCHNEI DER T ( SCHN- I ) ; SCHRCER A ( SCHR- I ) ; STEI NM*\BL B 
( STEI - I ) ; STEI NM*\BL G M ( STEI - I ) ; STEI MvABL K ( STEI - I ) ; \AANDI NGER M 
( WAND- I ) 

I nvent or : SCHNEI DER T; SCHRCEER A; STEI MVASSL Q STEI NIWSSL G D; WMj) NGER 
SCHRCER A; STEI NIVABL B; STEI MvABL G M. STEI NM\BL K 



M ; 
Pat er 



'at ent Farri I y ( 4 

Pat ent 
Nurrber 
DE 10304460 
WD 2004070702 
EP 1590795 



ent s 



1 07 count r i es) 
Appl i cat i on 
Nurrber 
DE 10304460 
WD 2004EP527 
EP 2004704214 
YD 2004EP527 

US 20060143008 A1 20060629 YD 2004EP527 
US 2005544596 



Ki nd 
B3 
A1 
A1 



Dat e 
20040311 
20040819 
20051102 



Ki nd 
A 
A 
A 
A 
A 
A 



Dat e 
20030204 
20040122 
20040122 
20040122 
20040122 
20050804 



Updat e 
200420 
200455 
200573 



Pr i or i t y Appl i cat i ons (no., kind, date): DE 1 0304460 A 20030204 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

DE 10304460 B3 DE 6 0 

WD 2004070702 A1 DE 

Nat i onal Desi gnat ed St at es, Q i gi nal : AE AG AL AM AT AU AZ BA BB BG BR BW 
BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH CM HR 
HU I D I L I N I S JP KE KG KP KR KZ LC LK LR LS LT LU LV M*\ MD M3 M< MM MA/ 
MK MZ NA Nl NO NZ CM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR 
TT TZ UA UG US UZ VC VN YU ZA ZM ZW 

Regi onal Desi gnat ed St at es, Or i gi nal : AT BE BG BWCH CY CZ DE DK EA EE ES 
FI FR GB GH GM GR HU I E I T KE LS LU M3 MN MZ NL CA PT RO SD SE SI SK SL 
SZ TR TZ UG ZM ZW 

EP 1590795 A1 DE PCT Appl i cat i on YD 2004EP527 

Based on CPI patent YD 2004070702 

Regi onal Desi gnat ed St at es, Or i gi nal : AL AT BE BG CH CY CZ DE DK EE ES FI 
FR GB GR HU I E I T LI LT LU LV MD M< NL PT RO SE SI SK TR 

US 20060143008 A1 EN PCT Appl i cat i on YD 2004EP527 

Speech recognition nret hod e. g. for rrobi I e t el ephone, i dent i f i es whi ch 
spoken variants of sarre word can be... 

Cr i gi nal Ti 1 1 es: 

. . . GENERATI CN AND DELETI CN OF PRCNUNCI ATI CN VARI ATI CMS I N ORDER TO REDUCE 
THE WCRD ERROR RATE I N SPEECH RECOGNI TI CN 



..Generation and deletion of pronunciation variations in order to reduce 
he word error rate in speech recognition 



. . . GENERATI CN AND DELETI CN OF PRCNUNCI ATI CN VARI ATI CNS I N ORDER TO REDUCE 
THE WCRD ERROR RATE I N SPEECH RECOGNI TI CN 
... I nvent or : W\NDI NGER M 

Alerting Abstract ...NOVELTY - The speech recognition method has 
several variants for a spoken word generated, with registration during a 
speech recognition process of which spoken variants of the word can be 
identified and analysis of the recognition... 

...a speech recognition device; a corrput er program product for a 
speech recognition method 
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...USE- The speech recognition method can be used for a mobile 
t el ephone . 



Cr i gi nal Publ i cat i on Dat a by Aut nor i t y 



Inventor name & address: 

... Wmdinger, M chael , 81829 Ivlinc hen, DE . . . 

. . . VWSIDI NGER M chael . . . 

. . . Vtendi nger , M chael . . . 

. . . VWSIDI NGER, M chael 
Or i gi nal Abst ract s: 

. . . Di scl osed i s a speech recogni t i on met hod whi ch is based on a dynarri c 
ext ensi on of the wor d model s i n corrbi nat i on wi t h an eval uat i on of t he 
pr onunci at i on var i at i ons. . . 

. . . Di scl osed i s a speech recogni t i on met hod whi ch is based on a dynarri c 
extension of the word models in corrbi nat i on wi t h an evaluation of the 
pr onunci at i on var i at i ons. . . 

. . . Di scl osed i s a speech recogni t i on met hod whi ch is based on a dynarri c 
extension of the word models in corrbi nat i on wi t h an evaluation of the 
pr onunci at i on var i at i ons. 



DI ALOQ( R) Fi I e 350: Der went WI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
0013747646 

WI ACC NQ 2003-846303/200379 
XRPX Acc No: N2003- 676371 

Det er rri nat i on of short text rressages using background data held in rressage 
t errpl at es to sirrplify the generation of rressages 

Patent Assi gnee: SI E^NS AG ( SI El ) 

Inventor: KUNSTIVANN N; STEI NIVASSL G; VWDI NGER M 

Patent Fanri I y (1 patents, 1 countries) 

Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

DE 10238292 A1 20031106 DE 10238292 A 20020821 200379 B 

Pri ori ty Appl i cat i ons (no. , kind, date): DE 10238292 A 20020821 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

DE 10238292 A1 DE 6 0 

... I nvent or : VWSIDI NGER M 

...data. This is used in rressage t errpl at es and input is spoken and 
interpreted by a speech recognition unit. 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor name & address: 

... Vtendinger, M chael , 81829 Mjnchen, DE 



10501857 Language Recognizer And Operating IVet hod Therefor 



p 11/3, K 7 8 { i t em 8 f rom t il e: 350) (Note: Current app) 

Di % L 0 X R) Fi I e 350 Dei went W\ X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
0013505945 

VNPI ACC NQ 2003-598627/200356 
XRPX Acc No: rx£003- 476927 

Automatic speech recognizer e.g. for rrobi I e t el 
vocabulary containing narres used for voice control 
f i I es 

Patent Assignee: MUOR A R (MUCH); SI EMENS AG ( SI El ) ; \AANDI NGER M 
( WAND- I ) 

! nvent oi MUCR h R WNPl NGER M 
Patent Farri I y (6 patents, 32 countries) 
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Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg 

WO 2003060879 A1 DE 15 0 
Nat i onal Desi gnat ed St at es, Cr i gi nal 
Regional Designated St at es, Cr i gi nal 
GB GR HU I E I T LU M3 NL PT SE SI 
EP 1466318 A1 DE 

Regional Designated St at es, Cr i gi nal 

FR GB GR HU I E I T LI LT ' ' ' "' 
US 20050021342 A1 EN 
EP 1466318 B1 DE 

Regional Designated St at es, Cr i gi nal 
DE 50304848 G DE 



Fi I i ng Not es 
US 



AT BE BG CH CY CZ DE DK EE ES FI FR 
SK TR 

PCT Appl i cat i on WO 2003EP5 
Based on CPI patent WO 2003060879 
AL AT BE BG CH CY CZ DE DK EE ES FI 
LU LV M3 IvK NL PT RO SE SI SK TR 

PCT Appl i cat i on WO 2003EP5 
PCT Appl i cat i on WO 2003EP5 
Based on CPI patent WO 2003060879 

DE ES FR GB I T 
Appl i cat i on EP 2003729416 
PCT Appl i cat i on WO 2003EP5 
Based on CPI patent EP 1466318 
Based on CPI patent WO 2003060879 
ES 2268366 T3 ES Application EP 2003729416 

Based on CPI patent EP 1466318 

Automatic speech recognizer e.g. for rrobi I e t el ephone, uses stored 
vocabulary containing narres used for voice control of... 

Cr i gi nal Ti 1 1 es: 

. . . LANGUAGE RECCGMI ZER AND CPERATI NG METHCD THEREFCR. . . 

. . . LANGUAGE RECCGMI ZER AND CPERATI NG METHCD THEREFCR . . 

... Language recognizer and operating method therefor... 

. . . LANGUAGE RECCGMI ZER AND CPERATI NG METHCD THEREFCR 
I nvent or : MUOR A R . . . 
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. . . VWSIDI NGER M 

Alerting Abstract ... NO/EL TY - The speech recognizer has a stored 
vocabulary of recognizable words in the f or m of names, used for voice... 

. . . dat a f i I e, the narres of t he I i nks pr ovi ding a first act i ve par t i al 
vocabulary for the speech recognizer . DESCRI PTI CN - An I INDEPENDENT 
CLAI M f or an operating method for an automatic speech recognizer is 
al so i ncl uded. . . 

...USE- The automatic speech recognizer is used for voice control of 
programs and/ or data files, e.g. for a... 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor name & address: 
MM OR, Andreas Ralph ... 

. . . VWSIDI NGER M chael . . . 

... MM OR, Andreas Ralph ... 

. . . VWSIDI NGER M chael . . . 

... Ivajor, Andreas Ralph ... 

. . . Vtendi nger , M chael . . . 

... MM OR Andreas Ralph ... 

. . . VWslDI NGER M chael 
Cr i gi nal Abst ract s: 

Disclosed is a language recognizer with a stored vocabulary of words to 
be recognized for the language -based controlling of a plurality of 
programs and/ or other files. A word of said... 

...or file, the narres of said links forming a first active partial 
vocabulary of the language recognizer . 



. . . Di scl osed i s a I anguage r ecogni t i on appar at us havi ng a st or age wi t h 
a stored vocabulary of words to be recognized for the language -based 
cont r ol I i ng of pr ogr ams and/ or ot her f i I es. A wor d of t he vocabul ar y i s 
assi gned. . . 

...file, and t he narres of t he I i nks f or m a f i r st act i ve par t i al vocabul ar y 
of the language recognition apparatus. Also disclosed is methods of 
operating the language recognition apparatus including providing a 
language recognition apparatus, for example, as described above, and 
generating a current vocabulary containing at least the names of the links 
from the file directory when a voice recognition pr ogr am conf i gur ed to 
perform voice recognition is started... 

...Disclosed is a language recognizer with a stored vocabulary of words 
t o be r ecogni zed f or t he I anguage - based cont rolling of a pi ur al i t y of 

programs and/ or other files. A word of said... 

...or file, the names of said links f or rri ng a first active partial 
vocabulary of the language recognizer . 



G ai rrs: 
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... Voice recogniser having a stored vocabulary of words that are to be 
recognised for voice control of a plurality of programs and/ or other 
f i I es, to each of whi ch is... 

. . . f i I e di r ect or y, the names of t he I i nks f or rri ng a first act i ve par t i al 
vocabulary of the voice recogniser , <b>char act er i sed in that</b>the 
file di r ect or y cont ai ns a pi ur al i t y of sub- di r ect or i es on at I east . . . 

... I i nks f or rri ng a f i r st and, where appl i cabl e, f ur t her hi er ar chi cal I y 
subordinate active partial vocabularies of the voice recogniser and 
each program or f i I e bei ng assi gned f rom a sub-di rect ory a voi ce command 
composed. . . 

...<b>1</b>. A voice recognizer having a stored vocabulary of words to 
be recognized for voice control of a plurality of programs and/ or other 
files, each of which are assigned... 

. . . f i I e di r ect or y, the narres of t he I i nks f or rri ng a f i r st act i ve par t i al 
vocabulary of the voice recognizer. > 



Di'ALOQ("R)"Fi' T e" "350: De'r went' 'Wr'X 
(c) 2008 The Thomson Corporation. A I rts. reserv. 

0013468888 - Drawing available 
\API AOC NG 2003-560595/200353 
XRPX Acc No: N2003- 445609 

Ivet hod for voice recognition , especially of narres, for corrrrercial and 
private uses, involves phonerre- voi ce-conversi on with acoustic output of 
obt ai ned phonerre sequence 

Patent Assignee: SI EN/ENS AG ( SI El ) 

I nventor: KUNSTPVANN N; SCHRCEER A; ST El NK/ASSL G; WAND NGER M 
Patent Farri I y (1 patents, 26 countries) 
Pat ent Appl i cat i on 

Number Ki nd Dat e Nurrber Ki nd Dat e Updat e 

EP 1302928 A1 20030416 EP 2001124717 A 20011016 200353 B 

Pri ori ty Appl i cat i ons (no. , kind, date): EP 2001124717 A 2001 1016 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

EP 1302928 A1 DE 13 4 

Regi onal Desi gnat ed States, CM gi nal : AL AT BE CH CY DE DK ES FI FR GB GR 

I E I T LI LT LU LV IVC IVK NL PT RO SE SI TR 
Ivet hod for voice recognition , especially of narres, for corrrrercial and 
private uses, involves phonerre- voi ce- conver si on with acoustic output... 

Cr i gi nal Ti 1 1 es: 

... Ivet hod for speech recognition , particularly of narres, and speech 
recogni zer 

... I nvent or : VWSIDI NGER M 

Al er t i ng Abst r act . . . NCVELTY - A voice recognition method, especially 
for narres, has an input phase for establishing a voice recognition 

-vocabulary and includes the steps (a) of text -phonerre conversion of a word 
i nput t ed as. . . 

...narres, for gaining at least one phonerre sequence provisionally assigned 
to the word i n the voi ce r ecogni t i on vocabul ar y, foil owed by ( b) 
phonerre- voi ce-conversi on and acoustic output of the obtained phoneme 
sequence. . . 

. . . c) eval uat ed by assuni ng t he provi si onal phonerre sequence is a val i d 
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phoneme sequence i n t he voice recognition vocabulary and then either 
returning to step (a) for getting another provisional phoneme sequence or 

DESCRI PTI CN - An I INDEPENDENT!" CLAI Mis given for a voi ce - recogni t i on 
device for speaker - i ndependent voice recognition . 



. . . ADVANTAGE - Hi gh I evel of voi ce recogni t i on accuracy i s achi eved 
wi t h r el at i vel y I ow i rrpl ement at i on out I ay 

Or i gi nal Publ i cat i on Dat a by Aut hor i t y 



Inventor name & address: 
. . . Vtendi nger , M chael 



A 1 7/ 3, KM (Item 1 f romf i I e: 347) 

Dl ALOG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI Q All rts. reserv. 

07973391 ** I mage avai I abl e** 
VO CE CONTROL SYSTEM 

PUB. NQ : 2004-086150 [ J P 20040861 50 A] 

PUBLISHED: March 18, 2004 (20040318) 
INVENTCR(S): TAKAM MASAYUKI 

NADA TORU 
APPLI CANT( s) : DENSO CCRP 

APPL. NO. : 2003- 109283 [ J P 20031 09283] 
FILED: April 14, 2003 (20030414) 

PRIORITY: 2002- 189481 [ JP 2002189481] , JP (Japan), June 28, 2002 

(20020628) 

VO CE CONTROL SYSTEM 

I NTL CLASS: G10L-015/28 ; G10L-015/00 ; G10L-015/06 ; G10L-015/10 ; 

H04O 009/ 00 

ABSTRACT 

PROBLEM TO BE SOLVED: To provide a voice control syst em f or properly 
recognizing a speech uttered by a user and controlling a target 
device by recognizing the user's speech considering the operating state of 
t he t ar get devi ce. 

SCLUTI CN: A speech recognition dictionary switching unit 4b selects a 
speech recognition dictionary including only the comrands selectable in 
current operating states of the control target device 2 detected by a 
device- st ate detecting unit 4a. Wien a speech recognition engine 3 
recognizes a user's utterance , it recognizes only the comrands included 
in the speech recognition dictionary and does not recognize other 
commands. Thus, such comrands that do not coincide with the... 



1 7/ 3, K/ 2 ( 1 1 em 2 f r om f i I e: 347) 

Dl ALOG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI O. All rts. reserv. 

07722278 ** I mage avai I abl e* * 
SPEECH RECOGNITION SYSTEM 

PUB. NO. : 2003-216179 [ J P 200321 61 79 A] 

PUBLISHED: July 30, 2003 (20030730) 
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INVENTCR(s): ARAKI NCBUMaSA 

APPLI CANT(s) : NEC CCRP 

APPL. NO.: 2002-015705 [ J P 20021 5705] 

FILED: January 24, 2002 (20020124) 

SPEECH RECCGNI Tl CN SYSTEM 

I NTL CLASS: G10L-015/18 

ABSTRACT 

PROBLEM TO BE SOLVED: To realize a system which can correctly recognizing 
a conmrand I anguage even when t he cormrand I anguage i s det ect ed as 
another speech by uttering one conmrand language at intervals. 

SOLUTION: The speech recognition syst em i s provided wi t h a means 5 for 
changing a recognition dictionary by decorrposi ng the conmrand language 
in a recognition dictionary 4 i nt o prescribed units such as units of 
word and syl I abl e and a hoi di ng means 6 f or hoi di ng a r ecogni t i on r esul t s 
recognized by a speech recognition means 3. The recognition means 3 
recognizes the speech uttered by a user based on the recognition 
dictionary changed by the changing means 5 and judges whether or not 
plurality of corrbi nat i ons of recognition results becomes the cormrand 
language registered i n t he recognition dictionary before change when a 
plurality of the recognition results are held in the holding means 6. Thus, 
the conmrand language can be correctly recognized even i n t he case that 
the conmrand language is uttered by being divided into a plurality of 
speeches such as the case that the user utters the conmrand language 
at i nt er val s. 

COPYRIGHT: (C)2003,JPO 



1 7/ 3, W 3 ( 1 1 em 3 f r om f i I e: 347) 

DI ALOG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI O. All rts. reserv. 

07103539 ** I mage avai I abl e** 
VO CE RESPCNDI NG DEVI CE 

PUB. NQ : 2001-331196 [ JP 2001331 196 A] 

PUBLISHED: Noventer 30, 2001 (20011130) 
I NVENTCR( s) : I V\ATA KAZUHI KO 
APPLI CANT(s) : NEC CCRP 

APPL. NO. : 2000- 150035 [ J P 20001 50035] 
FI LED: May 22, 2000 ( 20000522) 

I NTL CLASS: G10L-015/28 ; G10L-015/20 ; G10L-015/00 ; G10L-015/22 

ABSTRACT 

... BE SOLVED: To di scrim nat e a user as an 'inexperienced' user when 
hesitation and an unnecessary word exist in a response even though the 
reaction t i me of the response is same as the one made by an experienced use 
who is ski I I ed in operat i on . 

SOLUTION: A voice recognition section 1, which receives voice uttered 
by a user , recognizes the order of uttering of the words and the 
phrases that are beforehand registered in a voice recognition 
dictionary section 2. An unnecessary word detecting section 3 checks 
whether an unnecessary word is included i n t he recognition result of the 
section 1 or not. Wien an unnecessary word is included, the positional 
relationship between the unnecessary word and an objective word i n t he 
r ecogni t i on r esul t is checked. A ski I I est i mat i ng sect i on 4 est i mat es t he 
skill of the operations of the voice responding device of the user 
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based on the checked result obtained by the section 3... 



1 7/ 3, K/ 4 ( 1 1 em 4 f r om f i I e: 347) 

Dl ALOG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI O. All rts. reserv. 

06848273 * * I mage avai I abl e* * 

DEVI CE AND IVETHCD FCR I NFCRMATI CN PROCESSI NG AND PROGRAM STORAGE MEDI UM 



PUB. NO. : 
PUBLI SHED: 
I NVENTCR(S) 



APPLI GANT(s) 
APPL. NQ : 
FI LED: 



2001-075773 [ J P 2001 075773 A] 
March 23, 2001 (20010323) 
KCJ I m Kl YCNCBU 
KATO YASUHI KO 
YCNEKURA SHLU I 
FLU I MJRA SATCSHI 
SASAI TAKASHI 
FLU I SAV\A NACKI 
O JUNJI 
SGMY CORP 

11-250675 [JP 99250675] 
Septerrber 03, 1999 (19990903) 

I NTL CLASS: G06F-003/ 16; G10L-015/00 

ABSTRACT 

PROBLEM TO BE SOLVED: To easily operate a program by recognizing a voice 
and generating a word corresponding t o t he voice , storing a corrrrand 
of a speci f i c pr ogr am cor r espondi ng to t he word , and sendi ng t he corrrrand 

corresponding t o t he generated word to the program 

SOLUTION: Data corresponding to a voice of a user are inputted and the data 

in specific form such as a text corresponding to a word that the user 
speaks are generated and supplied to a voice corrrrander 102. A voice 
recognition engine 101 receives the recognized word or the data of 
grarmrar or the like from the voice coirmander 102 and stores t hem i n a 

reading KANA ( Japanese syllabary) dictionary database 111 or a recognized 
word and grarmrar database 112 for the engine. Wien the voice 
corrrrander 102 is supplied with the data of the word or t he I i ke 

corresponding to specific speech that the user speaks from the voice 
recognition engine 101 sends a specific corrrrand to a still picture 

phot ogr aphy pr ogr am 1 03, a st i I I . . . 



1 7/ 3, K/ 5 ( 1 1 em 5 f r om f i I e: 347) 

Dl ALCG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI O. All rts. reserv. 

06778014 ** I mage avai I abl e** 
CONTROL METHOD OF NETWCRK 



PUB. NO. : 
PUBLI SHED: 
I NVENTCR(S) 



APPLI CANT(s) 
APPL. NO. : 
FI LED: 
PRI CRI TY: 



2001-005489 [ J P 2001 005489 A] 
January 12, 2001 (20010112) 
RAPP STEFAN 
SI LKE GCRCNJI I 
RALF KONPE 
PETER BUCHNER 
Gl RON FRANCK 
HELMUT LUCKE 

SONY I NTERNATL EURO? GVBH 
2000- 117217 [ JP 2000117217] 
Apri I 13, 2000 (20000413) 

99107201 [ EP 99107201] , EP (European Patent Office), 



Apri I 
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13, 1999 (19990413) 

I NTL CLASS: G10L-015/22 ; G10L-015/00 ; G10L-015/28 ; H04r^005/00; 
H04Q- 009/ 00 

ABSTRACT 

PROBLEM TO BE SOLVED: To speedi ngl y and flexibly control network devices by 
converting received and recognized voice cormrands into corresponding 
user network corrrrands based on all pairs of user network corrrrands included 
in... 

... of first and second devices and corresponding corrrrands L1 and L2 do not 
include same vocabulary elements. Therefore, L1 cap L2={} and a received 
language L, which is merged for interface description, becomes L=L1 cup L2. 
In other words , a general document in a voice device 1 is constructed by 
corrbi ni ng pairs of corrrrands corresponding t o t he vocabulary elements 
obtained from the device document, which is described by the vocabulary 
elements of the first device and the corrrrand L1, and the device document 
which is described by the vocabulary elements of the second device and 
the corrrrand L2. In other words , user 's uttered corrrrands are converted 
into user network corrrrands based on all pairs of user network corrrrands 
included in user corrrrand interpretation... 



17/3, K/ 6 (Item 6 fromfile: 347) 

Dl ALOG( R) Fi I e 347: JAPI O 

(c) 2008 JPO & JAPI O. All rts. reserv. 

06614467 ** Image available** 
TRANSLATI NG DEVI CE AND RECCRDI NG MEDI UM 



PUB. NQ : 
PUBLI SHED: 
I NVEr^CR(s) 



APPLI CANT(s) 
APPL. NO. : 
FI LED: 



2000-200275 [ J P 2000200275 A] 
Jul y 18, 2000 (20000718) 
KQZUM ATSUKO 
KAJ I HI ROYUKI 
CBUCHI YASUNARI 
Kl TAHARA YCSHI NCRI 
HITACHI LTD 
11-001669 [JP 991669] 
January 07, 1999 (19990107) 

G06F-017/28; G10L-013/00 ; G10L-015/00 ; G10L-015/22 



ABSTRACT 

. . . of an exarrpl e sentence retrieval type and an arbitrary sentence input 
type translation device by recognizing an input voice , converting 
words and phr ases i ncl uded i n an exarrpl e sent ence t errpl at e si rri I ar t o t he 
voi ce accordi ng to t he voi ce, and di spl ayi ng a t ransl at ed sent ence. 
SCLUTI ON: Wth a power button 103 pressed, a program in a memory is 
st art ed and an i ni t i al pi ct ur e appears on a di spl ay scr een 101. Wien a 
voice is inputted in this state through a mcrophone 106, a key word 
voice recognition is perforrred. Wien a word is voiced and inputted at 
this time, a word voice recognition by speech recognition 
algorithm such as Hidden Markov Model ( HMv) is perforrred by referring to a 
word dictionary and word candidates are displayed on the screen. Wien 
a user voices and inputs a key word , an exarrpl e sentence t errpl at e is 
retrieved and displayed. Wen the user speaks the exarrpl e sentence 
template by replacing its replaceable words with other words , what 
exarrpl e sentence t errpl ate is used and what replacing word is used are 
recognized and a translated sentence of the input sentence is displayed and 



1 7/ 3, K/ 7 
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05696892 * * I rrage avai I abl e* * 
VOICE INPUT CONTROL APPARATUS 



PUB. NO. : 09-311692 [JP 9311692 A] 

PUBLISHED: Decerrber 02, 1997(19971202) 

I NVE^CR( s) : MATSUIVDTO YCSHI YUKI 

APPLI CANT(s) : HCNDA M3TCR CO LTD [000532] (A Japanese Corrpany or 

Cor por at i on) , JP (Japan) 

APPL. NO. : 08- 158766 [JP 96158766] 

FI LED: May 16, 1996 ( 19960516) 

VOCE INPUT CONTROL APPARATUS 



I NFL CLASS: G10L-003/00 ; G10L-003/00 ; G10L-003/00 ; Q05B-015/02; 
Q06F-003/ 16 

. . . JAPI O KEYWORD: Speech Recognition & Synthesis) 

ABSTRACT 

PROBLEM TO BE SOLVED: To learn the tendency of a user to utter 
meaningless voices and the habit of saying, and properly deal with the 
tendency corresponding it by properly changing the count er rreasur e at the 
t i me when unregistered words are sent in corresponding to the input 
frequency of the unregistered words which can not be distinguished as 
voi ce. . . 

...SOLUTION: An unregistered word detecting part 11, an unregistered 
word memory part 12, an input frequency counting part 13, and an 
unregistered word count er rreasur e controlling part 14 are installed under 
the control of a control part 10. The unregistered word detecting part 11 
detects the word uttered with voice is an unregistered word which is 
not registered in a voice dictionary 3. The unregistered word memory 
part 12 memorizes the detected unregistered word together with the 
characteristic quantity. The input frequency counting part 13 counts the 
input frequency for every received unregistered word and stores the 
result in a renewal way. The unregistered word count er measur e controlling 
par t 14 changes t he count er rreasur e at t he t i rre when t he unr egi st er ed word 
i s r ecei ved. 



17/3, K/ 8 (Item 1 fromfile: 350) 

Dl ALOG( R) Fi I e 350: Der went V\PI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
0014131590 

VNPI ACC NO 2004-316222/200429 
XRPX Acc No: N2004-251924 
Voice recognition device has dictionary expansion unit for adding sound 
unit sequence forrred by imaging unit, unit for selecting sound unit 
sequence fromnurrber of sequences using selection criteria 
Pat ent Ass i g nee: I NFI NEON TECHNCLCQ ES AG (I NFN) 
I nvent or : KUESTNER M RCEMER R 
Patent Farri I y (1 patents, 28 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

W) 2004029931 A1 20040408 WD 2003DE3160 A 20030923 200429 B 

Pri ori ty Appl i cat i ons (no. , kind, date): DE 10244168 A 20020923 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

W) 2004029931 A1 DE 104 23 



10501857 Language Recognizer And Operating Ivet hod Therefor 



Nat i onal Desi gnat ed St at es, Cr i gi nal : CN US 

Regi onal Desi gnat ed St at es, Cr i gi nal : AT BE BG CH CY CZ DE DK EE ES Fl FR 
GB GR HU I E I T LU M3 NL PT RO SE SI SK TR 
Voice recognition device has dictionary expansion unit for adding sound 
unit sequence forrred by imaging unit, unit... 

Cr i gi nal Ti 1 1 es: 

... VO CE RECCGNI TI CN DEVI CE, OCNTRCL DEVI CE AND IVETHCD FOR 
CGVPUTER- ASSI STED CGVPLETI CN CF AN ELECTRCNI C Dl CTI CNARY FCR A VO CE 
RECCGNI TI CN DEVI CE 

Al ert i ng Abst ract . . . NOVELTY - The devi ce has a uni t f or 
speaker - i ndependent voice recognition , an electronic dictionary of 
words for speaker - i ndependent voice recognition , an imaging unit for 
f or rri ng spoken voice signals on a sequence of sound units, a dictionary 
expansion unit for adding a sound unit sequence forrred by the imaging unit 
and a. . . 

DESCRI PTI CN - The voi ce recogni t i on devi ce has a voi ce recogni t i on 
unit for speaker - i ndependent voice recognition , an electronic 
dictionary of words for speaker - i ndependent voice recognition , a 
voice signal imaging unit for f or rri ng spoken voice signals on a sequence 
of sound units, a dictionary expansion unit for adding a sequence of 
sound units forrred by the imaging unit and... 

... a cont r ol I er f or cont r ol I i ng a t echni cal syst em with an i nvent i ve 
voice recognition devi cea t el ecormruni cat i ons device wi t h an 
inventive cont r ol I er and a rret hod of corrput er - ai ded expansion of an 
electronic dictionary for a voice recognition unit. 

...USE- For voice recognition . 

...ADVANTAGE - Guarantees speaker - dependent and speaker - i ndependent voice 
r ecogni t i on i n a si rrpl e manner . 

0 ass Codes 

1 nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 

G10L-0015/06 . . . 

. . . G10L-0015/ 18 . . . 

... G10L-0015/22 
G10L-0015/00 . . . 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

... t he i nvent i on, a sequence of sound uni t s is f or med f r om at I east one 
voice signal uttered by a user , said sequence representing said voice 
signal . The sequence of sound units is formed in such a way that they 
can be processed by a voice recognition unit as part of a speaker 
independent voice recognition process . The sequence of sound signals 
thus forrred is added to an electronic dictionary which contains 
entries for speaker independent voice recognition . 



* 17/3, K/9 (I tern 2 fromfile: 350) 

Dl ALOG( R) Fi I e 350: Der went VNPI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
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0013566602 - Drawing available 
VNPI ACC NQ 2003-660878/200362 
XRPX Acc No: N2003-527162 

Speech recognition nret hod for use in e.g. voi ce- di al i ng application, 
involves dynarri cal I y updating stored recognition vocabulary for 
subsequent utterances , based on corrparison of received utterance and 
recognition vocabulary 
Pat ent Assi gnee: GUPTA S K ( GUPT- 1 ) 
I nvent or : GUPTA S K 

Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 20030120493 A1 20030626 US 200127580 A 20011221 200362 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 200127580 A 20011221 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 20030120493 A1 EN 8 2 

Speech recognition nret hod for use in e.g. voi ce- di al i ng application, 
involves dynarri cal I y updating stored recognition vocabulary for 
subsequent utterances , based on comparison of received utterance and 
recognition vocabulary 

Al er t i ng Abst r act ...NOVELTY - Received utterance is corrpar ed wi t h 
stored recognition vocabulary representing currently active vocabulary . 
The stored recognition vocabulary is dynarri cal I y updated for subsequent 
utterances , based on the corrparison result.... speech recognition 
system; nret hod of custom zing r ecogni t i on vocabul ar y. . . 

. . . USE - For voi ce - di al i ng and corrrrand / cont rol appl i cat i ons i n 
cl i ent devi ce. 



...ADVANTAGE - Allows any upgrade t o t he voice recognition features on 
the client device , without requiring equipment change. Provides the 
abi I i t y to dynarri cal I y change t he cur r ent act i ve vocabul ar y and to... 

. . . DESCRI PTI CN CF DRAW NGS - The f i gur e shows t he speech r ecogni t i on 

syst em 

a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-021/00 
( Addi t i onal / Secondary) : G10L-011/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Or i gi nal Abst ract s: 

. . . syst em i ncl udes a cl i ent devi ce i n corrrruni cat i on wi t h a server . The 
client device receives an input speech utterance in a voice 
di al og vi a an i nput devi ce f r om a user of t he syst em The cl i ent devi ce 
includes a speech recognition engine that corrpar es the received input 
speech to stored recognition vocabulary representing a currently active 
vocabulary . The speech recognition engine recognizes the 
received utterance , and an application dynarri cal I y updates the 
recognition vocabulary . The dynarri c update of the active vocabulary 
can al so be i ni t i at ed f r om t he server , dependi ng upon t he cl i ent 
application being run at the client device . The server generates a 
r esul t t hat is sent to t he cl i ent devi ce vi a a sui t abl e corrrruni cat i on pat h. 
The client application also provides the ability to custom ze voice - 
activated conmrands in the recognition vocabulary related to corrrron 
client device functions, by using a speaker -training feature of the 
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speech recognition engine. 
0 ai rrs: 

Wiat is claimed is: <b>1</b». A met hod of recognizing speech so as 

to modify a currently active vocabulary , corrprising : receiving an 

utterance ; comparing said received utterance to a stored recognition 
vocabulary representing a currently active vocabulary ; and dynarri cal I y 

updating the stored recognition vocabulary for subsequent received 
utterances based on said comparison. 



( 1 1 em 3 f rom f i I e: 350) 

350:Derwent \API X 



17/3, K/ 10 

Dl ALOG( R) Fi I e : 

(c) 2008 The Thomson Corporation. All rts 

0013191855 - Drawing available 
V\PI ACC NQ 2003-275516/200327 
XRPX Acc No: NE003-218762 

Voice recognition syst em f or wireless terrrinal, generates digital 
signal from received analog speech signal, and updates voice recogniti 



t errpl at es in dat abase based on di gi t al si gnal 

' " ■ CHANG C ( CHAN- I ) ; K/ALAYATH N ( IvALA- I ) 



Pat ent Assi gm 

( QUAL- N) 
I nvent or : CHANG C; MALAYATH N 
Patent Farri I y (4 patents, 99 countries 



QUALCCMM I l\C 



Pat ent 
Nurrber 
US 20020178004 
WD 2002095729 
TW 557443 
AU 2002305661 



Ki nd 
A1 
A1 
A 
A1 



Dat e 
20021128 
20021128 
20031011 
20021203 



Appl i cat i on 
Nurrber 

US 2001864059 
WD 2002US16104 
TW 20021 10885 
AU 2002305661 



Ki nd 
A 
A 
A 
A 



Dat e 
20010523 
20020521 
20020523 
20020521 



Updat e 
200327 
200327 
200424 
200452 



Pri ori ty Appl i cat i ons (no. , kind, date): US 2001864059 A 20010523 



Fi I i ng Not es 



Pat ent Det ai I s 

Nurrber Ki nd Lan Pg 

US 20020178004 A1 EN 15 

WD 2002095729 A1 EN 

Nat i onal Desi gnat ed St at es, O i gi nal : AE AG AL AM AT AU AZ BA BB BG BR BY 
BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU I D 
I L I N I S J P KE KG KP KR KZ LC LK LR LS LT LU LV M*\ MD M3 M< MM MA/ MK MZ 
NO NZ CM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN 
YU ZA ZM ZW 

Regi onal Desi gnated States, CM gi nal : AT BE CH CY DE DK EA ES FI FR GB GH 

GM GR I E I T KE LS LU MD WW MZ NL CA PT SD SE SL SZ TR TZ UG ZM ZW 
TW 557443 A ZH 

AU 2002305661 A1 EN Based on CPI patent WD 2002095729 

Voice recognition syst em f or wi r el ess t er rri nal , generat es di gi t al 
signal from received analog speech signal, and updates voice recognition 
t errpl at es in dat abase based on di gi t al si gnal 

Cr i gi nal Ti 1 1 es: 

METHOD AND APPARATUS FOR ADAPTI NG VQ CE 



REOCGMI TI CN TEMPLATES. . 



...Method and apparatus for voice recognition 

'. '. METHOD AND APPARATUS FOR ADAPTI NG VO CE REOCGMI TI CN TEMPLATES 

Al er t i ng Abst r act . . . NOVELTY - A voice recognition syst em (20) 
comprises a speech processor (24) to receive an analog speech signal 
and t o generat e a di gi t al si gnal . A st or age uni t (30) st ores t he di gi t al 
signal and updates the voice recognition templates in a database (22) 
based on the digital si gnal . . . . rret hod for voice recognition in 
wireless corrrruni cat i on device; wireless apparatus; and handwriting 
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r ecogni t i on syst em . . 

...USE- For speech recognition wireless term nal, for accessing 
information services such as e- rrai I , stock quotes, weather updates, 
travel advisories, corrpany news, rrovi e schedules, local news, sports... 

. . . DESCRI PTI ON CF DRAW NGS - The f i gur e shows t he bl ock di agr am of t he 
voice recognition system 



20 voice recognition system 



. . . 24 speech processor 



a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-015/00 ... 
. . . G10L-015/06 . . . 
... G10L-017/00 

Or i gi nal Publ i cat i on Dat a by Aut hor i t y 



Or i gi nal Abst ract s: 

A voice recognition syst em appl i es user inputs to adapt speaker - 
dependent voice recognition terrplates using i rrpl i ci t user 

confirmation during a transaction. In one errbodi rrent , the user confirms the 
vocabulary word to corrpl et e at transaction , such as entry of a 
password, and in response a t errpl at e database is updated. User 
utterances are used to generate test terrplates that are corrpared t o t he 
t errpl at e database. Scores are generated for each test... 

...A voice recognition syst em appl i es user inputs to adapt speaker - 
dependent voice recognition terrplates using i rrpl i ci t user 
conf i r mat i on during a transaction . In one errbodi rrent , the user 
confirms the vocabulary word to corrpl ete at transaction, such as entry 

of a password, and in response a t errpl at e database is updated. User 
utterances are used to generate test terrplates that are corrpared t o t he 
t errpl at e database. Scores are generated for each test t errpl at e and a winner 
sel ect ed. . . 
a ai rrs: 

Wiat is cl ai rred is: <b>1</b>. A voice recognition syst em corrpr i si ng: 
a speech processor operative to receive an analog speech signal 
and generate a digital signal; a database operative to store voice 

recognition terrplates; and a memory storage unit coupled to the 
speech processor and the database, the memory storage unit operative 
to store the digital signal, the rrerrory storage unit operative to 
update the voice recognition terrplates based on the digital signal and 
an i rrpl i ci t user conf i r mat i on. 



17/3, K/ 11 (Item 4 fromfile: 350) 

DI ALOG( R) Fi I e 350: Der went V\PI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0012847367 - Drawing available 
V\PI ACC NQ 2002-705880/200276 
XRPX Acc No: N2002- 556465 

Continuous audio stream processing rret hod for speech recognition 
syst errs, involves perform ng speaker recognition upon speaker change 
detection, and transcribing portion of audio stream if predet er nri ned 
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speaker is recognized 

Pat ent Assi gnee: I BM CCRP ( I BIVC) 
I nvent or : FRANK J; KRI ECHBAUM W STENZEL G 
Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 20020091517 A1 20020711 US 2001997957 A 20011130 200276 B 

Pri ori ty Appl i cat i ons (no. , kind, date): EP 2000127335 A 20001130 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 20020091517 A1 EN 9 4 

Continuous audio stream processing method for speech recognition 
systems, involves perform ng speaker recognition upon speaker change 
detection, and transcribing portion of audio stream.. 

Alerting Abstract ...Continuous audio stream processing apparatus; 
Speech recognition or voice control system; Data processing 
program for continuous audio st r earn pr ocessi ng; and Corrput er program 
product for continuous audio st r earn pr ocessi ng. . . 

...USE- For processing continuous audi o st r earns in speech recognition 
or voice control system (clairred) for handling corrrrercial or 
business transaction related telephone conversations, automatic 
transaction systems in tele shopping, tele banking, rail t i - user speech 
recognition and voice activation control systems for personal 
meetings for use in vehicles such as aircraft cockpits and for call 
cent er envi r onrrent s. . . 

G ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-015/00 
Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

Disclosed ar e a rret hod and apparatus for processing a continuous audio 
stream containing human speech in order to locate a particular 
speech- based transaction i n t he audio stream applying both known speaker 
recognition and speech recognition techniques . Hereby it is 
enabled that only the utterances of a particular 1 pr edet er m ned speaker 
are transcribed thus providing an index and a sunmary of the underlying 
di al ogue( s. . . 

...a second scenario, two or more speakers located in one room are using a 
rail t i - user speech recognition system(SRS). For each user there 
exists, a different speaker model and optionally a different 
dictionary or vocabulary of words already known or trained by the speech 

or voice recognition system In such an environment, the invention 
al I ows t o swi t ch between different dictionaries when a first user has 
stopped utterance and a second user i s goi ng to start his utterance. 

a ai rrs: 

<b>1</b>. A met hod of processing a continuous audio stream containing 
human speech related to at least one particular transaction, corrprising 
t he st eps of : di gi t i zi ng t he cont i nuous audi o. . . 



17/3, K/ 12 (Item5 fromfile: 350) 

DI ALCG( R) Fi I e 350: Der went VNPI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 
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0012839181 - Drawing available 
VNPI ACC NQ 2002-697555/200275 
XRPX Acc No: N2002- 550021 

Automated speech recognition net hod in computer system involves 
accessing voi ce- nrai I , cal I er- i dent i t y features corresponding to spoken 

utterance , if spoken utterance has specific grarrrrar 

Patent Assignee: LUCENT TECHNCLOGI ES I NC (LUCE) 

I nvent or : AUGUST KG 

Patent Farri I y (1 patents, 1 countries) 

Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 20020094067 A1 20020718 US 2001764708 A 20010118 200275 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 2001764708 A 20010118 

Pat ent Det ai I s 



US 20020094067 A1 EN 32 14 

Automated speech recognition nret hod in computer system involves 
accessing voi ce- nrai I , cal I er- i dent i t y features corresponding to spoken 

utterance , if spoken utterance has specific grarrrrar 

Cr i gi nal Ti 1 1 es: 

Network provided information using text -to- speech and speech 
recognition and text or speech activated network control sequences for 
corrpl i rrent ar y feature access 

Alerting Abstract ...state database corrprising tree structure with nodes 
representing particular system state is maintained. A spoken utterance of 
a subscriber at a particular system state is recognized by corrparing 
with grarrrrar for respective node corresponding t o t he system state. The 
voi ce- mai I , cal I er - i dent i t y features corresponding to spoken utterance 
are accessed, if spoken utterance has specific grarrrrar . ...USE- For 
recognizing speech of user such as verbal information verification, 
word pronunciation in corrput er system mobile telephone, personal digital 
assistant, t el errat i cs for vehicle, ear bud. .. 

. . . ADVANTAGE - The syst em can corrpar e t he spoken ut t er ance wi t h I i rri t ed 
nurrber of words available for syst em st at e, thereby sirrplifying speech 
recognition task. Enhances functionality of the speech recognition by 
usi ng subscr i pt i on- 1 ype ser vi ce whi ch pr ovi des voi ce model to t he user . The 
access restriction prevents the user f r om acci dent al I y triggering voice 
cormrand syst em while engaged in a conversation and increases amount of 
gr arrrrar t o be used at . . . 

. . . DESCRI PTI CN CF DRAW NGS - The f i gur e shows t he bl ock di agr am of t he 
speech recognition system 

G ass Codes 

I nt er nat i onal 0 assi f i cat i on ( + At t r i but es) 
I PC + Level Val ue Posi t i on St at us Versi on 
G10L-0013/04 . . . 

... G10L-0015/26 
G10L-0013/00 . . . 

... G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

...per nits the subscriber to manage i nconi ng and existing calls through 



Nurrber 



Ki nd Lan 
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available features accessed using spoken utterances . A speech 
processing unit coupled to the syst em i nt er pr et s a subscriber's 
spoken utterances without requiring the subscriber to train the system 
to recognize his or her voice . The interpretation of spoken utterances 
is enabled by a syst em state database that is maintained at the speech 
processing unit and corrprises a database of the possible system 
si at es, i ncl udi ng possi bl e cal I f I ows f or a cal I , and a dat abase associ at ed 
with the syst em st at e database comprising context- specific grarrrrar that a 
subscriber may recite at respective points in the call flow. The speech 
processing unit may also convert message signals from the network to 
speech which is read t o t he subscriber using a... 

...to speech translator. The network can identify the voice or subscriber 
voice, or language used and will thereafter recognize all further 
corrmands using specific grarrrrar for that language as well as perform 
t ext - 1 o- speech conversion using the identified language. Use of the... 
a ai rrs: 

. . . per rri 1 1 i ng a subscr i ber to perf or m an act i on avai I abl e on a 
comruni cat i ons network using a spoken utterance , conprising: maintaining 
a syst em st ate database comprising a tree structure having a plurality of 
nodes, each respective node of said plurality of nodes representing... 

... of a pi ur al i t y of possi bl e syst em st at es and bei ng associ at ed wi t h a 
predetermined node- speci f i c grarrrrar for the respective node; awaiting 
from the subscriber a spoken utterance at the particular system state; 
recognizing the spoken utterance by corrparing the spoken utterance to 
the pr edet er rri ned grarrrrar for the respective node for correspondence to 
the particular system state ; and perform ng an action at the network 
represented by the spoken utterance if the spoken utterance has been 
recognized as the pr edet er rri ned grarrrrar for the respective node. 



17/ 3, K/ 13 (Items fromfile: 350) 

Dl ALOG( R) Fi I e 350: Derwent WI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0010974490 - Dr awi ng avai I abl e 
WI ACC NQ 2001-598427/200168 
XRPX Acc No: N2001- 446299 

Designing voice - activated user interface used in a corrput er systern 
i nvol ves j oi nt I y opt i rri zi ng sel ect ed vocabul ar y set and pr orrpt i ng synt ax 
based on second testing of subjects from tar get corrrrunity 

Patent Assignee: COWERSE INC ( COW- N) ; CCMVERSE NETWRK SYSTEM INC 

( COW- N) 
I nvent or : YUSCHI K M J 

Patent Farri I y (12 patents, 28 countries) 
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Pr i or i t y Appl i cat i ons ( no. 
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20020813; US 2002216811 
2002216839 A 20020813 

Pat ent Det ai I s 

Nurrber Ki nd Lan 

EP 1107229 A2 
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US 20022161S 
A 20020813; 



: US 1999456922 A 19991207; US 
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Di vi si on of appl i cat 
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□ vi si on of appl i cat 

□ vi si on of appl i cat 



on 



US 1999456922 
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Di vi si on of appl i cat i on I L 71200 

D vi si on of pat ent I L 140146 
US 7139706 B2 EN D vi si on of application US 1999456922 

□ vi si on of patent US 6526382 
Designing voice - activated user interface used in a corrput er systern 
i nvol ves j oi nt I y opt i rri zi ng sel ect ed vocabul ar y set and pr orrpt i ng. . . 

Cf i gi nal Ti 1 1 es: 

Language- or i ent ed user interfaces for voice activated services... 
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Language- or 
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user 


i nt erf aces 


for 
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act i vat ed 


ser vi ces. . . 



speech r ecogni t i on 



Syst em and rret hod of developing automatic 
vocabulary for voice activated services 

Alerting Abstract ...the corrput er syst em using the voice - activated 

user interface; and the corrput er readable rredi urn encoded with the program 
for voice - activated user interface... 

...ADVANTAGE - Designs I anguage- or i ent ed user interfaces for voice 

act i vat ed ser vi ces. Col I ect s r eponses to t ask- or i ent ed quest i ons el i ci t i ng 

corrrronl y used names for tasks and task... 
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...terrporal syntax. Changes corrrrand and syntax parameter of user interface 
based on results of testing user interface with speakers of target 
language. Detects errors occurring in dialogue between user and user 
i nt erf ace based on. . . 

. . . DESCRI PTI ON CF DRAW NGS - The f i gur e shows an over vi ew f I ow di agr am of 
designing voice - activated user interface. 

a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-015/06 
I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
G10L-0015/06 . . . 

. . . G10L-0015/06 . . . 

. . . G10L-0015/18 . . . 
... G10L-0015/22 
G10L-0015/00 . . . 

... G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A corrpr ehensi ve syst em i s provided for designing a voice activated user 
i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed to t he 
cul t ur e and. . . 

...is general across languages and encompasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prompting 
gr arrmar . . . 

... A corrpr ehensi ve syst em i s provided for desi gni ng a voice activated 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with serrant i c and syntactic structures so adapted, as well as a prorrpting 
gr arrrrar . . . 

... A corrpr ehensi ve syst em i s provided for desi gni ng a voice activated 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prorrpting 
gr arrmar . . . 

... A corrpr ehensi ve syst em i s pr ovi ded f or desi gni ng a voi ce act i vat ed 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prorrpting 
gr arrrrar . . . 

... A corrpr ehensi ve syst em i s pr ovi ded f or desi gni ng a voi ce act i vat ed 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 
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...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prompting 
gr arrrrar . . . 

... A corrpr ehensi ve syst em i s pr ovi ded f or desi gni ng a voi ce act i vat ed 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prorrpting 
gr arrrrar di v xht rri : cl ass=" par agr aph" >A corrpr ehensi ve syst em i s pr ovi ded f or 
designing a voice activated user interface (VAUI) having a semantic 
and syntactic structure adapted to the culture and... 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prorrpting 
gr arrmar . . . 

... A corrpr ehensi ve syst em i s pr ovi ded f or desi gni ng a voi ce act i vat ed 
user i nt erf ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user interfaces 
with semantic and syntactic structures so adapted, as well as a prorrpting 
gr arrrrar . . . 

... A corrpr ehensi ve syst em i s provided for desi gni ng a voice activated 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...is general across languages and encorrpasses universal variables of 
language and culture. Also provided are voice activated user 
interfaces. Specifically, a prorrpting syntax is defined with syntax 
pararreters (such as pace, pause... 

... A corrpr ehensi ve syst em i s provided for desi gni ng a voice activated 
user i nt er f ace ( VA Ul ) havi ng a serrant i c and synt act i c st r uct ur e adapt ed t o 
t he cul t ur e and. . . 

...hypothetical task to be perforrred; asks each of the at least one 
respondent for a word that the respondent would use to conmand the 
hypothetical task to be perforrred; receives, from each of the at least one 
respondent, a corrrrand word ; develops a I i st of coirmand words from the 
received corrrrand word ; and rejects the received corrrrand word , if the 
received corrrrand word is acoustically si rri I ar to another word i n t he 
list of coirmand words . The approach is general across languages and 
enconpasses universal variables of language and culture. Also provided are 
prorrpting gr arrrrar and error handling methods adapted to such user 
i nt erf aces, 
a ai rrs: 

A met hod for designing a voice activated user interface, the method 
corrpr i si ng: separ at el y selecting a vocabul ary set and a prorrpting syntax for 



. . . Wiat is cl ai med i s : <b>1 </ b>. A met hod f or opt i rri zi ng a voi ce 
activated user interface, the method corrpr i si ng: conf i gur i ng the user 
interface wi t h a vocabulary of corrrrand words including at least one 
word indicating a corresponding task and selected from plural words for 
the task based on frequency of use; andchangi ng at least one of a coirmand 
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and a syntax parameter of the user interface based on results of testing 
the user i nt er f ace wi t h speakers of a target language... 

. . . Wiat is cl ai med i s : <b>1 </ b>. A met hod f or devel opi ng an aut omat i c 
speech recognition (ASR) vocabulary for a voice activated service, 
the method corrpr i si ng: a. posing, to at least one respondent, a hypothetical 
task to be performed; b. asking each of the at least one respondent for a 
word that the respondent would use to comrand the hypothetical task to be 
per for med; c. receiving, from each of the at least one respondent, a command 
word ;d. developing a I i st of corrmand words from the received corrrrand 
word ; e. rejecting the received corrrrand word , if the received corrrrand 
word is acoustically si rri I ar to another word i n t he I i st of corrrrand 
words . 



. . . Wiat is cl ai med i s : <b>1 </ b>. A met hod f or adapt i ve err or handl i ng in a 
voice activated user interface, the rret hod corrpr i si ng: det ect i ng that an 
error has occurred in a dialogue between... 

. . . Wiat is cl ai med i s : <b>1 </ b>. A rret hod f or pr orrpt i ng a user of a voi ce 
activated user interface, the rret hod corrpr i si ng: pausi ng for a first 
pr edet er rri ned interval after presentation of a label identifying a current 
menu state of the user interface; andpr esent i ng t o t he user a corrrrand 
option for the current rrenu state only... 

. . . Wiat is cl ai rred i s: <b>1</ b>. A rret hod f or sel ect i ng a vocabul ary set f or 
a voice activated user interface, the rret hod corrpr i si ng: col I ect i ng 
responses to task-oriented questions eliciting corrrronl y used names method 
for designing a voice activated user interface, the rret hod 
corrpr i si ng: separ at el y selecting a vocabulary set and a pr orrpt ing syntax for 



. . . Wat is cl ai rred i s: 1 . A rret hod of devel opi ng an aut orrat i c speech 
recognition (ASR) vocabulary for a voice activated service, 
comprising: a. posing, to at least one respondent, a hypothetical task to 
be perf or rred; b. asking each of the at least one respondent for a word 
that the respondent would use to corrrrand the hypothetical task to be 
per f or rred; c. receiving, from each of the at least one respondent, a corrrrand 
word ;d. developing a I i st of corrrrand words from the received corrrrand 
word ; e. rejecting the received corrrrand word , if the received corrrrand 
word is acoustically si rri I ar to another word i n t he I i st of comrand 
words ; andf . selecting for t he I i st of corrrrand words a comrand word 
that is both serrant i cal I y equivalent t o t he received comrand word and 
relatively frequently received, when the received corrrrand word is 
r ej ect ed. 
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Pri ori ty Appl i cat i ons (no. , kind, date): US 1997899952 A 19970724 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 6246989 B1 EN 11 7 

Alerting Abstract ... IVet hod for providing functions t o t he user; Variable 

state transaction system; Variable state transaction system for 

arranging functions for presentation to an user based on user's device 



G ass Codes 
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G10L-0015/22 . . . 

... G10L-0015/26 
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Cr i gi nal Abst ract s: 

...only a small nurrber of function choices. The system uses a data base 
having key words that describe each function, and per nut at i on lists that 
have expanded lists of phrases that are synonymous wi t h the key words . 
The system also may use a thesaurus or lexicon application to expand the 
recognition of key words . The system uses a di al ogue engine to corrpare 
the conmrands (e.g., voice input ) from the user with the key words 
and per nutation lists to identify the requested function. 
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Regi onal Desi gnat ed St at es, O i gi nal : AT BE CH CY DE DK ES Fl FR GB GR I E 
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Based on CPI patent YC 2001067435 
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...has input of reference vocabulary by each user used by speech 
recognition device for al I owi ng speech- cont rol I ed operation by several 
users 

Cr i gi nal Ti 1 1 es: 

. . . IvETHCD FCR THE VCI CE - CONTROLLED I Nl TI ATI ON OF ACTI CNS BY ^NS OF A 

LI M TED CI ROLE OF USERS, WHEREBY SAI D ACTI CNS CAN. . . 

. . . Ivet hod for the voice - controlled initiation of actions by means of a 
United circle of users, whereby said actions can... 

. . . IvETHCD FCR THE VOI CE - CONTROLLED I Nl TI ATI CN CF ACTI CNS BY ^NS CF A 

LI M TED CI RCLE CF USERS, WHEREBY SAI D ACTI CNS CAN. . . 

Al ert i ng Abst ract . . . rret hod uses a t rai ni ng phase f or ent er i ng a I i rri t ed 
nurrber of spoken corrrrands in a speech recognition device by the user, 
each corresponding to a required action, wi t h subsequent analysis of the... 

...The speech recognition device can be used for operation of the 
device by more than one user by... 

G ass Codes 

International Q assi f i cat i on ( IVai n) : G10L-015/06 
I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 

G10L-0015/06 . . . 

G10L-0015/00 . . . 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Of i gi nal Abst ract s: 

...voice is detected on the basis of a speaker - dependent voice detection 
system in a user -independent manner and without user identification. 
The reference voice patterns of all users pertaining to a voice detection 
system are allocated to detection voice expressions, e.g. the words of a 
vocabulary , of the users pertaining to the circle of users , whereby 
said patterns are required for detection... 

...is to control initiation of actions in a user - i ndependent manner and by 
means of voi ce and users pertai ni ng to a I i rri t ed ci rcl e of users of an 
appliance, whereby said actions can... 

...basis of a speaker - dependent voice detection system in a 
user - i ndependent manner and wi t hout user identification. The reference 
voi ce pat t er ns of al I users pert ai ni ng to a voi ce det ect i on syst em ar e 
allocated to detection voice expressions, e.g. the words of a vocabulary 
, of the users pertaining to the circle of users, whereby said patterns 
are required for detection... 

... be car r i ed out i n t he appl i ance. The voi ce i s det ect ed on t he basi s of a 
speaker - dependent voice detection syst em i n a user - i ndependent manner 

and without user identification. The reference voice patterns of all users 
per t ai ni ng to a voi ce det ect i on syst em ar e al I ocat ed t o det ect i on voi ce 

expressions, e.g. the words of a vocabulary , of the users pertaining to 

the circle of users, whereby said patterns are required for detection. 
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G ai nrs: 

. . . Benut zer zu rri ndest ens ei ner Akt i on j ewei I s rri ndest ens ei ne der Akt i on 
j ewei I s zuzuordnenden Ref er enzspr achausser ung (reference speech utterance 
) in das Gerat ei ngi bt , 2. aus der Ref er enzspr achausser ung durch 
Spr achanal yse ei n Ref er enzspr achmist er erzeugt wird, wobei bei mehreren 
Ref er enzspr achausser ungen das Ref er enzspr achrrust er erzeugt... 

. . . <b>1 </ b>. IVet hod for the voi ce - cont r ol I ed i ni t i at i on of act i ons by 
means of a I i rri t ed ci r cl e of users, whereby sai d act i ons can be car r i ed out 
i n an appl i ance, corrpr i si ng t he f ol I owi ng f eat ur es: ( a) Cn t he basi s of t he 

voice pertaining to at least one user of the user circle of the device, 
t he devi ce. . . 

...the users, with respect to at least one action, enters at least one 
reference speech utterance into the device, whereby said reference speech 

utterance is respectively allocated t o t he action,(a2) a reference speech 
pattern is generated from the reference speech utterance by speech 
analysis, whereby the reference speech pattern, given a plurality of 
reference speech utterances , is generated when the reference speech 
ut t er ances ar e si rri I ar , ( a3) t he r ef er ence speech pat t er n is al I ocat ed t o 
the action,(a4) the reference speech pattern is unconditionally stored 
with the allocated action or is only stored when the reference speech 
pattern is not si rri I ar to the already stored other reference speech 
patterns which are allocated to other act i ons, (b) the respective user, in a 

voice recognition phase, enters a recognition speech utterance 
into the device for the operating mode of the device selected by the 
user,(c) a recognition speech pattern is generated from the 
recognition speech utterance by speech analysis.(d) the recognition 
voice pattern is corrpared to at least a part of the reference 
speech patterns , which are stored for the selected operating mode, such 
that t he si rri I ar i t y between the respective reference speech pattern and 

the recognition speech pattern is detected and such that a 
similarity rule of precedence of the stored reference speech patterns 
i s f or med on t he basi s of t he det ect ed si rri I ar i t y val ues, ( e) the voi ce - 
controlled initiation of the action to be carried out i n t he device by the 
user --whereby said voice - controlled initiation is caused by the 
recognition voice utterance - -is adrri ssi bl e when the recognition 
speech pattern is si rri I ar to the reference speech pattern which is first 
in the si rri I ar i t y rule of precedence or when the recognition speech 
pattern is si rri I ar to the reference speech pat t er n whi ch i s f i r st i n t he 
similarity rule of precedence and when said recognition speech 
pattern is not similar t o t he reference speech pattern situated at 
the n-th position in the similarity rule of precedence, whereby another 
action is allocated t o t he reference speech pattern situated at the 
n-th position in the sirrilarity rule of precedence than t o t he action that 
is allocated to the reference speech pat t er n whi ch i s f i r st i n t he 
similarity rule of precedence and whereby the reference speech... 

...pattern situated first in the sirrilarity rule of precedence, is only 
carried out when the recognition voice utterance, in a speech 
recognition phase, entered by the user into the device for the operating 
mode of t he devi ce 
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Conrput er i zed speech recognition procedure involves creating vocabulary 
states, associated wi t h program unit by pattern matching of acoustic word 
signals for each user input 

Pat ent Assi gnee: DRAQCN SYSTEM I NC ( DRAG- N) 
I nvent or : HEI TlvAN P S; M3GRATH F J; PARKE J W SOW RES S D 
Patent Farri I y (1 patents, 1 countries) 



Pat ent 
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Ki nd Dat e 
A 20000718 



Appl i cat i on 
Nurrber 

US 1992976413 
US 1995382752 
US 1997882920 



Ki nd 
A 



Dat e 
19921113 
19950201 
19970626 



Updat e 
200061 



Pri ori ty Appl i cat i ons (no. , kind, date): US 1992976413 A 19921113; US 

1995382752 A 19950201; US 1997882920 A 19970626 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 6092043 A EN 127 81 C- I - P of application US 1992976413 

□ vision of application US 1995382752 

C- I - P of pat ent US 5428707 

Conrput er i zed speech recognition procedure involves creating vocabulary 
states, associated with program unit by pattern matching of acoustic word 
signals for each user input 

Or i gi nal Ti 1 1 es: 

Apparatuses and rret hod for training and operating speech recognition 
syst errs. 

Alerting Abstract ...NOVELTY - The vocabulary states with words are 
created for program unit when acoustic word signals are input by user . 
Active vocabulary states are set based on pattern matching to det er rri ne 
probable vocabulary states for word signals. Wien vocabulary states 
having input focus for currently running pr ogr am exi st s, they are added to 
active set or else new set of vocabulary states are created. USE - For 
operation of syst errs that recognize acoustic word signal... 
...ADVANTAGE - Since vocabulary states are recognized for acoustic 
signals, word recognition is convenient and easy... 

. . . DESCRI PTI ON OF DRAW NGS - The f i gur e shows t he devi ce used f or speech 
recogni t i on . 

Ti 1 1 e Terrrs. . . / 1 ndex Ter rrs/ Addi t i onal Wxds: VCPD ; 
G ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
. . . G10L-0015/06 . . . 

. . . G10L-0015/ 18 . . . 



G10L-0015/22 . . . 



... G10L-0015/26 
. . . G10L-0015/00 

Or i gi nal Publ i cat i on Dat a by Aut hor i t y 

Or i gi nal Abst ract s: 

A word recognition system can: respond to the input of a character string 
f rom a user by I i rri t i ng t he words it wi I I recogni ze to words havi ng a 
related, but not necessarily the same, string, score signals generated 
after a user has been prorrpted to generate a given word against words 
other than the prorrpted word to deter nine if the signal should be used to 
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train the prorrpted word ; vary the nurrber of signals a user is prorrpted to 
gener at e to t r ai n a gi ven word as a f unct i on of how wel I t he t r ai ni ng 
signals score against each other or prior rrodel s for the prorrpted word ; 
create a new acoust i c rrodel of a phrase by concatenating prior acoustic 
models of the words i n t he phrase; obtain information from another 
pr ogr am r unni ng on the same computer, such as... 

...the context of text being entered into it, and use that information to 
vary which words it can recognize; deterrrine which program unit, such as 
an application program or dialog box, currently has input focus on i t s 
computer and create a vocabulary state associated with that program unit 
i nt o whi ch vocabul ary words whi ch wi I I be made act i ve when t hat pr ogr am 
group has the focus can be put . . . 

...and alter the instructions it executes in response; test if i t s abi I i t y 
to respond to voice input has been shut off without user confirmation, 
and, if so, turn that ability back on... 

... be t ur ned of f ; st or e bot h a f i r st and a second set of rrodel s f or 
individual vocabulary words and enable a user to selectively cause the 
recognizer to disregard the second set of models for a selected word ; 
and/ or score a signal representing a given word against models for that 
word f r om di f f er ent word rrodel sets to select which rrodel should be used 
f or f ut ur e r ecogni t i on. 
a ai nrs: 

A corrput er i zed rret hod of perforrring word recognition designed to 
operate on a corrput er system which can run rail t i pi e program units at 
one time and which can selectively give one of those program units an input 
f ocus. . . 

...to receive certain user input, said corrput er i zed method conrpr i si ng the 
steps of: receiving user generated word signals representing words to be 
recognized; storing a plurality of vocabulary words, and storing a 
plurality of vocabulary states with which one or rrore vocabulary 
words can be associated; enabling a user to selectively add one or 
rrore vocabulary words to a vocabulary state; defining a set of 
one or rrore vocabulary states which are active at a given time; 
associating each of one or rrore of said vocabulary states with a 
corresponding ones of said program unit; perforrring pattern matching upon 
the word signals to select which one or rrore of a plurality of 
vocabulary words associated with the set of currently active 
vocabulary states most probably correspond to each such word signal; 
determning which program unit, if any, running on said computer system 
currently has the input focus; and determning whether or not the program 
unit having the input focus has a vocabulary state associated with it, 
and if so adding that vocabulary state to the set of active vocabulary 
states, and if not creating a new vocabul ar y. . . 
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BR 199913408 A PT 



AE AL AM AT AU AZ BA BB BG BR BY CA 
GBGDGEGHGMHRHUIDIL IN IS JP KE 
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UZ VN YU ZA ZW 

AT BE CH CY DE DK EA ES Fl FR GB GH 
PT SD SE SL SZ UG ZW 



EP 1110207 



A2 EN 



Regional Designated St at es, Cr i gi nal 
I T LI LU MJ NL PT SE 



JP 2002524777 
AU 760377 



Regi onal 

IT LI LU 
DE 69924853 



Desi gnat ed St at es, Cr i gi nal 
M3 NL PT SE 



Based on CPI patent WD 2000014729 
PCT Appl i cat i on WD 1 999SE1 51 5 
Based on CPI patent WD 2000014729 
PCT Appl i cat i on WD 1 999SE1 51 5 
Based on CPI patent WD 2000014729 
AT BE CH CY DE DK ES Fl FR GB GR I E 

PCT Appl i cat i on WD 1 999SE1 51 5 
Based on CPI patent WD 2000014729 
Previously issued patent AU 9958944 

Based on CPI patent WD 2000014729 
PCT Appl i cat i on WD 1 999SE1 51 5 
Based on CPI patent WD 2000014729 
AT BE CH CY DE DK ES Fl FR GB GR I E 



Appl i cat i on 
PCT Appl i cat 
Based on CPI 
Based on CPI 



EP 1999946549 
on WD 1999SE1515 
patent EP 11 10207 
patent WD 2000014729 



syst em f or accurat e voi ce di al I i ng in rrobi I e 



Speech r ecogni t i on 
t el ephone 

Alerting Abstract ...NOVELTY - The speech recognition syst em has 
words i n a vocabul ary ar r anged in a t r el I i s st r uct ur e corrpr i si ng a 
nurrber of groups of words . A I i rri t ed nurrber of groups of the entire 
vocabulary is searched f or a word at each time. The vocabulary is 
arranged in a tree structure. The syst em i ncl udes an element for out put t i n ; 
the words the syst em is set to recognize at a particular moment. 
DESCRI PTI CN - I NDEPENDENT CLAI M3 ar e i ncl uded f or a speech r ecogni t i on 
met hod. . . 
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...with echo effects. At each instant system only needs to search United 
part of entire vocabulary . Solves problem of having to search many words 
at a t i me, t hus savi ng t i me and conrput at i onal I oad on syst em . . 

. . . DESCRI PTI ON CF DRAW NGS - The f i gure shows a bl ock di agram of a speech 
r ecogni t i on syst em 

G ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-015/06 ... 
... G10L-015/28 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
G10L-0015/00 . . . 

. . . G10L-0015/06 . . . 

. . . G10L-0015/08 . . . 

... G10L-0015/28 
G10L ... 

. . . G10L-0015/00 . . . 

... G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

In a speech recognition syst em of a nrobi I e telephone the words 
are organized in a t r el I i s st r uct ur e. Thus, at each instant the speech 
recognition syst em only needs to search a United part of the entire 
vocabulary . Such an arrangerrent solves the problem of having to search 
many words at a t i rre, whi ch i s t i me consurri ng and i rrposes a hi gh 
corrput at i onal load on the system and will therefore significantly increase 
the accuracy for the speech recognition system 



...In a speech recognition syst em of a rrobi I e t el ephone t he words 
are organized in a trellis structure. Thus, at each instant the 
speech recognition syst em only needs to search a United part of the 
entire vocabulary . Such an arrangerrent solves the problem of having to 
sear ch many words at a t i rre, whi ch i s t i rre consurri ng and i rrposes a hi gh 
corrput at i onal load on the system and will therefore significantly increase 
the accuracy for the speech recognition system 



...In a speech r ecogni t i on syst em of a rrobi I e t el ephone t he wor ds ar e 
organized in a trellis structure . Thus , at each instant the speech 
recognition system only needs to search a United part of the entire 
vocabulary . Such an arrangerrent solves the problem of having to search 
many words at a t i rre, whi ch i s t i rre consurri ng and i nposes a hi gh 
conput at i onal load on the system and wi I I t her ef or e si gni f i cant I y 
increase the accuracy for the speech recognition system 



a ai rre: 

...A speech recognition syst em for a mobile telephone for recognizing 
entered corrmand words , the syst em corrpr i si ng: means for analyzing entered 
voi ce si gnal s, means f or st or i ng a vocabul ary cont ai ni ng a pi ur al i t y of 
words, andmeans for corrparing entered voice signals with words in the 
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stored vocabulary , <t»char act er i zed in that</bx/br> the words in 
the vocabulary are arranged in a t r ee st r uct ur e corrprising a nurrber of 
groups of words in different levels, and that </br> the speech 
recognition syst em i s adapted to first recognize a first word entered 
into the mobile telephone at one level in the tree structure, and 
thereafter search only a group of words under the first word at the 
next I evel in t he t ree st ruct ure when a second word i s ent ered i nt o t he 
mobi I e t el ephone. 



. . . The i nvent i on cl ai rred i s: 1 . A speech recogni t i on syst em i n a mobi I e 
telephone, the speech recognition syst em corrpr i si ng: a stored 
vocabul ary , wher ei n words i n t he vocabul ary ar e ar r anged in a t r el I i s 
structure corrprising a plurality of different groups of words , anda word 

group selection syst em f or enabling a user to speak via voice 
conmrands t o sel ect at least a first of sai d pi ur al i t y of different groups 
of words , said first group of words being selected based upon at 
least a word spoken by the user , so that a I i rri it ed nurrber of groups 
of t he ent i re vocabul ary , I ess t nan sai d pi ural i ty, is searched f or a 
word during subsequent speech recognition processes i n t he mobile 
telephone after selection of at least the first of said plurality 
of gr oups of words. > 



17/3, K/ 18 (Item 11 fromfi I e: 350) 

Dl ALOG( R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0009765282 - Drawing available 
V\PI ACC NQ 2000-052130/200004 

Related Wl Acc No: 1995-240307; 1999- 070054; 1999-403770; 1999-404055; 

1999- 404056; 1999-456293; 1999-600343; 2000-586103; 2000-636971; 

2000- 637172 

XRPX Acc No: N2000- 040655 

Voi ce response conf i r mat i on syst em f or speech recognition system 
Patent Assignee: DRAQCN SYSTEM I NC ( DRAG- N) 
I nvent or: GCULD J M 

Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5983179 A 19991109 US 1992976413 A 19921113 200004 B 

US 1995382752 A 19950201 
US 1997882918 A 19970626 

Pri ori ty Appl i cat i ons (no. , kind, date): US 1 995382752 A 19950201; US 
1992976413 A 19921113; US 1997882918 A 19970626 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 5983179 A EN 128 89 C- I - P of application US 1992976413 

□ vision of application US 1995382752 

C- I - P of pat ent US 5428707 

Voice response confirmation syst em for speech recognition system 

Cr i gi nal Ti 1 1 es: 

Speech recognition syst em which turns its voice response on for 
confirmation when it has been turned off... 

AI ert i ng Abst r act ... t he speech r esponse i s t o be I ef t i n ON or CFF st at e 
is based on word which speech response selects corresponding to user's 
confirmation utterance . The speech response is turned CFF by storing 
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i ndi cat i ons whi ch i ndi cat es t hat CFF st at e has. . . 

DESCRI PTI CN - The acoustic signals received from a receiver is matched with 
models of vocabulary words . A speech- response unit responds to sounds 
which appear to match models of spoken words by perform ng functions 
associ at ed wi t h it. A conf i r mat i on pr ocess is per f or rred if it i s det ect ed 
that speech response is CFF and no indication is stored corresponding to 
CFF state confirmed by user . The user is prorrpted to utter a phrase 
conf i ruing whether or not speech response is to be CFF... 

...USE- For speech recognition system.. 

. . . ADVANTAGE - Cffers speech recognition apparatus by which user can 
recognize words easily and accurately. Enables user to correct errors in 
word recognition accurately. Irrproves ability to control speech 
recognition syst em by voice alone. Reduces number of sarrpl es of word 
an user has to generate to achieve given level of word recognition 
performance. Increases ability of word recognition to control or provide 
input to other programs running on same computer syst em as it is. Improves 
word recognition by which both specified word model s and custom word 
model s ar e i dent i f i ed. . . 

a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-007/08 
Or i gi nal Publ i cat i on Dat a by Aut hor i t y 



Or i gi nal Abst ract s: 

A speech recognition syst em i ncl udes a speech - response capability 
for responding to sounds which appear to match models of spoken words by 
performing functions associated with such words . This speech response 
can be turned on or off. If the syst em det ect s both that speech response 
is off and that no indication is stored that the off state has been 
confirmed by a user , it performs a confirmation process. This prompts 
the user to utter a phrase confirming whether or not speech 
response is to be off; turns speech response on so i t can respond to the 
user's conf i r mat i on utterance ; determines whether to leave speech 
response on or off in response to the word which the speech response 
selects as corresponding to the user's confirmation utterance ; and 
responds to such a det errri nation that speech response is to be turned 
off, both by turning it off and by storing an indication that the off state 
has been conf ir rred by the user . In some embodiments, the syst em can 
execute user written programs which can include instructions for 
turning speech response on or off. I n t hi s case, the syst em can delay the 
start of a conf i r mat i on process until after the execution of such a 
program, to al I ow such programs to temporarily turn off speech response 
without requiring user confirmation. In some embodiments, the user can 
select whether to deactivate the feature which causes a confirmation 
processes to occur whenever speech response has been turned off without 
conf i r mat i on. 
G ai rrs: 

A comput er i zed speech recognition system comprising : speech 

-response means including: means for receiving acoustics signals 
representing sounds received by a rricrophone; recognition means for pattern 
matching said acoustic signals against models of vocabulary words , to 
select, based on said pattern matching, which vocabulary words , if any, 
most probably correspond to those signals ; and means for responding to 
a selection of a given vocabulary word as most probably corresponding 
to a given acoustic signal by perform ng a function associated with 
that vocabulary word ; on- off means for selectively setting the speech 
response rreans t o an on or off state in which said speech response 
means, respectively, either can... 

...or not an off state of said speech response rreans has been confirmed by 
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the user ; confirmation means for perforrring a confirmation process 
including: first means for prompting the user to utter a phrase of one 
or more words confirming whether or not the user wants said speech 
response rreans to be in the off state; second means for causing said 
on- off means to set said speech response rreans t o an on state so said 
speech response means can respond to an utterance the user makes in 
response to said prompt; third rreans for deter nining whether to place said 
speech response means in said on or off state in response t o t he 
vocabulary word selected by the recognition rreans as corresponding to an 
acoustic signal received after said prompt; fourth means for responding to 
such a det er rri nat i on to place said speech response means off, by 
causing said on- off rreans to set said speech response means... 

. . . speech r esponse means bei ng in sai d of f st at e and sai d conf i r mat i on 
information indicating that the off state has not been confirmed by the 
user, for initiating said confirmation process in which said first, 
second, third, and forth rreans each performtheir respective function. 



17/3, K/ 19 (Item 12 fromfi I e: 350) 

Dl ALOG( R) Fi I e 350: Der went V\PI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0009648092 - Drawing available 
\API ACC NG 1999-600343/ 199951 

Related \API AccNo: 1995-240307; 1999- 070054; 1999-403770; 1999-404055; 

1999- 404056; 1999-456293; 2000-052130; 2000-586103; 2000-636971; 

2000- 637172 

XRPX Acc No: N1 999- 442508 

W>rd correction rrechani sm f or speech recognition system 
Patent Assignee: DRAQCN SYSTEM I NC ( DRAG- N) 

I nventor: GCULD J M, M3GRATH F J; PARKE J W SOW RES S D; STEELE E E 
Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Number Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5960394 A 19990928 US 1992976413 A 19921113 199951 B 

US 1995382752 A 19950201 
US 1997956498 A 19971022 



Pr i or i t y Appl i cat i ons ( no. 
1995382752 A 19950201; 



kind, date): US 1992976413 A 19921113; 
US 1997956498 A 19971022 



US 



Pat ent Det ai I s 

Nurrter Ki nd 

US 5960394 A 



Lan Pg Dwg Fi I i ng Not es 

EN 128 89 C-I-Pof application US 1992976413 
Cont i nuat i on of appl i cat i on US 

1995382752 

C- I - P of pat ent US 5428707 
Wmd correction rrechani sm f or speech recognition system 
Cr i gi nal Ti 1 1 es: 

Ivet hod of speech command recognition with dynamic assignment of 
pr obabi I i t i es accor di ng to t he st at e of t he cont r ol I ed appl i cat i ons. 

AI ert i ng Abst r act . . . t r ansf er s t he t i me var yi ng i nf or mat i on f r om ot her 
application pr ogr am act ual I y running on the computer to word recognition 
uni t . A pr obabi I i ty al ert i ng uni t sel ect s one correct word , f rom t he 
sel ect ed words , on pr obabi I i t y basi s, wi t h r espect to t he r ecei ved t i me 
var yi ng i nf or mat i on. DESCRI PTI CN - A r ecogni t i on uni t per f or ms pat t er n 
matching of stored vocabulary words corresponding t o t he word signals 
received and selects one or more words . An INDEPENDENT CLAI M i s also 
included for speech recognition method... 
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...USE- In speech recognition systems... 

. . . ADVANTAGE - Eases word recognition and offers error-free word 
recognition. Irrproves word recognition performance. Since training sanpl e 
of words are reduced. Irrproves performance and offers hands free 
operation. Algebraic filtering is used to I i rri t recognition of words 
starting with desired letter, thus improves probability of word 
recognition. Useful to handicapped users... 

Ti 1 1 e Terns/ 1 ndex Ter rrs/ Addi t i onal Wirds: VCPD ; 

a ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
. . . G10L-0015/06 . . . 

. . . G10L-0015/ 18 . . . 

. . . G10L-0015/22 . . . 

. . . G10L-0015/26 . . . 

... G10L-0015/28 
. . . G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A word recognition syst em i s disclosed for converting spoken 
utterances into either text or corrrrands. The syst em runs on a platform 
capable of running a plurality applications . Text and corrrrands are sent 
from a word recognition application to one or rrore user applications. 
In addition, information pertaining to the state of t he user 
applications is sent back to the word recognition application. W>rd 
recognition probabilities are rrodified based the information received 
from the user applications. As a result, the probabilities of recognizing a 
spoken utterance as a particular corrrrand will be greater when that 
corrrrand is act i ve in t he user appl i cat i on t hen when it... 

...Also, text strings will be assigned higher probabilities when they are 
appropriate for the present state of the user application. > 
a ai rrs: 

A word recognition system designed to run as part of one program 

on a corrput er syst em whi ch is capable of running a plurality of corrput er 
apl i cat i on programs at one t i rre, said word recognition system 
i ncl udi ng: rreans for receiving user generated word signals representing 
words to be recognized; recognition rreans for perform ng pattern 
matching upon the word signals to select which one or more of a 
plurality of vocabulary words most probably correspond to each such 
word si gnal ; ot her - pr ogr am rroni t or i ng rreans for repeatedly initiating 
the transfer to said word recognition syst em of time varying information 
from one or more other application programs running on said corrput er ; 
andpr obabi I i t y- al t er i ng means for repeatedly altering the probability of 
which vocabulary words will be selected by said recognition means as 
most probably corresponding to a received word signal as a function of 
the time varying information received as a result of said transfer. 



17/3, K/ 20 (Item 13 fromfi I e: 350) 

Dl ALOG< R) Fi I e 350: Derwent Wl X 
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0008977603 - Dr awi ng avai I abl e 
Wl ACC NG 1998-531449/ 199845 
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XRPX Acc No: N1 998- 41 4727 

Voice controlled personal corrputer i nt erf ace syst em f or m'litary, 
avionics application, robotics in plant - displays subroutines that aids 
user for providing input based on input utterances 
Pat ent Assi gnee: APPLI ED VO CE REOOGNI Tl CN LP ( VQ C- N) 
I nvent or : DOUGLAS H R 
Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5812977 A 19980922 US 1996696125 A 19960813 199845 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 1996696125 A 19960813 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 5812977 A EN 1 1 5 

Voice controlled personal corrputer i nt erf ace syst em f or m'litary, 
avi oni cs appl i cat i on, robot i cs in pi ant . . . 

...displays subroutines that aids user for providing input based on input 

ut t er ances 

Cr i gi nal Ti 1 1 es: 

Voice control corrputer interface enabling i rrpl errent at i on of comron 
subr out i nes. 

Alerting Abstract ...The syst em uses a sequence of corrputer instructions 
in order to per f or m sever al tasks. The user utters the sequence for 
performing tasks through a rricrophone (20). The corrputer instruction 
includes a help subroutine which aids the user for providing input based on 
the utterances from the user. A condi t i oni ng circuit (20) outputs a 
conditioned input signal from the input utterances . 

...USE- Voice controlled weapons launch system Voice controlled 
remotely piloted vehicle, speech recognition system 

Q ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-003/00 
Or i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

...used to open various subroutines. The disclosed system which is 
preferably operated by rreans of voice corrrrands , therefore irrproves the 

performance of the user so that the subroutines can be fetched rrore 
readily, operated rrore effectively... 
a ai rrs: 

A voice controlled corrputer interface syst em corrpr i si ng: ( a) a 
sequence of corrputer instructions requiring input by a user in order to 
perform one or rrore tasks; (b) a rricrophone into which said user utters 
said sequence; (c) a help subroutine included within said corrputer 
instructions which, upon activation by utterances from said user, aids 
sai d user i n provi di ng sai d i nput ; ( d) a condi t i oni ng ci r cui t f or f or m ng 
a conditioned input signal from said utterances ; ( e) a stored vocabulary 
; and(f) a recognition unit for correlating said conditioned input signal 
with an el errent of said stored vocabulary thereby creating corrpat i bl e 
instructions recognizable by said corrputer. 
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0007864042 - Drawing available 
WI ACC NQ 1996-494600/ 199649 
XRPX Acc No: N1 996- 41 71 18 

Audio control system for data processing system- outputs desirable 
pronunciation of word by corrbi ni ng t ext dat a wi t h notes, which is stored 

in di ct i onar y f i I e 

Pat ent Assi gnee: I BM CCRP ( I BIVC) ; I NT BUSI NESS IWCHI NES CCRP ( I BM3) 
Inventor: JCHNSCN W JCHNSCN WJ; CWEN; V\EBER O 
Patent Farri I y (3 patents, 2 countries) 



Pat ent 
Nurrber 
JP 8255047 
US 5787231 
JP 3065924 



Ki nd 
A 
A 
B2 



Dat e 
19961001 
19980728 
20000717 



Appl i cat i on 
Nurrber 

JP 199611677 
US 1995382737 
JP 199611677 



Ki nd 
A 
A 



Dat e 
19960126 
19950202 
19960126 



Updat e 

199649 B 

199837 E 

200039 E 



Pri ori ty Appl i cat i ons (no. , kind, date): US 1995382737 A 19950202 



Pat ent Det ai I s 

Nurrber Ki nd 

JP 8255047 A 

JP 3065924 B2 



Lan 
JA 
JA 



5 



Fi I i ng Not es 

Previously issued patent 



JP 08255047 



... out put s desi r abl e pr onunci at i on of wor d by corrbi ni ng t ext dat a wi t h 
notes, which is stored in dictionary file 

Cr i gi nal Ti 1 1 es: 

METHOD AND SYSTEM FCR ANNOTATI CN CF VQ CE 



CCNTROL SYSTEM . 

...Method and syst em f or irrproving pronunciation in a voice 
syst em 

Alerting Abstract ...This desirable pronunciation of t he wc 
i n a di ct i onary file ( 44. . . 



cont rol 

rd i s st or ed 



. / 1 ndex Terrrs/ Addi t i onal W>rds: VCPD ; 



Ti 1 1 e Ter rrs . 
Q ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
. . . G10L-0013/02 . . . 

. . . G10L-0013/04 . . . 

... G10L-0013/08 
. . . G10L-0013/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A voi ce enunciation syst em and rret hod provides a user with the 
capabi I i t y to sound out t ext f i I es. As t he f i I es ar e audi bl y pi ayed, if t he 
user is not satisfied with the pronunciation of a particular word , the 
syst em pr ovi des the user with the means of replacing the word with his 
own particular pronunciation. The preferred pronunciation is also stored 
in an override dictionary so that any subsequent encounter with that 
particular word is pronounced correctly, 
a ai rrs: 

<b>Q ai m 1 1 . </ b> A met hod i n a dat a pr ocessi ng syst em f or enhanci ng voi ce 
pronunciation of a textual input st r earn corner i si ng the steps 
of receiving text from the textual input st r earn, cust om zi ng a... 
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0006798545 - Drawing available 
V\PI ACC NG 1994- 185234/ 199423 
XRPX Acc No: N1 994- 146268 

Confusing phrase entry prevention in voice recognition system- using 
hidden Ivarkov model I i ng to corrpare repeated entries of additional phrase, 
wi t h cont r ol I ed var i at i on of al I ocat ed pr obabi I i t i es 

Pat ent Assi gnee: TEXAS I NSTR I NC ( TEXI ) 
I nvent or : I TTYCHERI AH A P; WHEATLEY B J 
Patent Farri I y (6 patents, 7 countries) 
Pat ent Appl i cat i on 
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Pat ent Det ai I s 
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Confusing phrase entry prevention in voice recognition system.. 

...using hidden Ivarkov rrodel I i ng to corrpare repeated entries of 
additional phrase, with controlled variation of allocated probabilities 

Cr i gi nal Ti 1 1 es: 

. . . Ivet hod and syst em pr event i ng entry of confusingly similar phrases in a 
voice recognition system vocabulary list... 

... Ivet hod and syst em f or preventing entry of confusingly si rri I ar phrases 
in a voice recognition system vocabulary list... 

... Ivet hod and syst em f or preventing entry of confusingly si rri I ar phases 
in a voice recognition system vocabulary list 

Alerting Abstract ...an additional phrase (60) for adding t o t he 
vocabulary list (10) in a speaker - dependent voice recognition system 
Probabilities (62) are allocated t o al I phrases (12), with existing phrases 
bei ng assi gned gr eat er . . . 

...phrase matches any al r eady- exi st i ng phrase (12,14,16) i n t he modified 
I i st . I f so, ent ry of t he new is i nhi bi t ed. . . 

... USE/ ADVANTAGE - In voice recognition system e.g. for dialling 
telephone nurrber s from database w. r . t . voice cormrand , to prevent addn. 
of conf usi ngl y si rri I ar corrrrands t o t hose on exi st i ng vocabul ary I i st . 
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Equivalent Al er t i ng Abst r act ...1. A met hod f or pr event i ng t he entry of 

confusingly si rri I ar phrases in a vocabulary list of a speaker - dependent 
voice recognition system corrpr i si ng t he st eps of . . . 
. . . phr ase, said first pr obabi I i t y and sai d second pr obabi I i t y cont rolling 
the likelihood that said speaker - dependent voice recognition system 
wi I I mat ch sai d second- r ecei ved phr ase to sai d f i r st - r ecei ved phr ase or 
sai d pi ur al i t y. . . 

a ass Codes 

International 0 assi f i cat i on ( Ivai n) : G10L-015/00 ... 
... G10L-003/00 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
. . . G10L-0015/06 . . . 

... G10L-0015/10 
. . . G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A rret hod and syst em pr event the entry of confusingly similar phrases 
(60) in a vocabulary list (10) of a speaker - dependent voice 
recognition system The rret hod first receives (20, 30, 50) and enrolls 
the phrase (60) for adding to vocabulary list (10). Next, probabilities 
(62) are assigned t o al I phrases (12). Probabilities (62) assigned to... 
... A met hod and syst em pr event the entry of conf usi ngl y si rri I ar phrases 
(60) in a vocabulary list (10) of a speaker - dependent voice 
recognition system. The rret hod first receives (20, 30, 50 ) and 
enrolls the phrase (60) for adding to vocabulary list (10). Next, 
probabilities (62) are assigned to all phrases (12). Probabilities (62) 
assi gned to... 
G ai rrs: 

1. A rret hod f or pr event i ng t he entry of confusingly si rri I ar phrases in 
a vocabulary list of a speaker -dependent voice recognition system 

corrpr i sing the steps of: first receiving a f i r st - r ecei ved phrase for 
addi ng to a pi ur al i t y of . . . 

... phr ase, said first pr obabi I i t y and sai d second pr obabi I i t y cont rolling 
the likelihood that said speaker - dependent voice recognition system 
will match said second- r ecei ved phrase to said f i r st - r ecei ved phrase or 
sai d pi ur al i t y of ot her . . . 

. . . 1 . A met hod f or pr event i ng t he ent ry of al r eady exi st i ng or 
confusingly si rri I ar phrases in a vocabulary list (10) of a 
speaker - dependent voice recognition system corrpr i sing the steps 
of:</br> first receiving a f i r st - r ecei ved phrase (60) pronounced by a 
user for adding to a plurality of other phrases (12, 14, 16) on a 
vocabulary list;</br> enrolling said phrase in said vocabulary 
I i st ; </ br > assi gni ng a fir st pr obabi I i t y to sai d phr ase ( 60) and a second 
pr obabi I i t y to each. . . 

. . . user , sai d f i r st pr obabi I i t y and sai d second pr obabi I i t y cont rolling the 
likelihood that said speaker - dependent voice recognition syst em wi I I 
match said second- r ecei ved phrase to said f i r st - r ecei ved phrase (60) or one 
of said plurality of other phrases (12, 14, 16) on said vocabulary 
list;</br> comparing said second - received phrase to said vocabulary 
list i ncl udi ng sai d f i r st - r ecei ved phr ase ( 60) and said pi ur al i t y of . . . 

... A rret hod f or pr event i ng t he ent ry of conf usi ngl y si rri I ar phr ases i n a 
vocabulary list of a speaker -dependent voice recognition system 
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corrpr i si ng the st eps of : f i r st r ecei vi ng a f i r st - r ecei ved phr ase f or addi ng 
to a plurality of other phrases on a vocabulary list; enrolling said 
f i r st - r ecei ved phrase in said vocabulary list; assigning a first 
probability to said phrase and a second probability to each of said 
plurality of other phrases, said second probability having a greater... 

... phr ase, said first pr obabi I i t y and sai d second pr obabi I i t y cont rolling 
the likelihood that said speaker - dependent voice recognition system 
wi I I mat ch sai d second- r ecei ved phr ase to sai d f i r st - r ecei ved phr ase or 
said plurality other phrases ; indicating whether said second- r ecei ved 
phrase matches one of said plurality of other phrases; and... 



17/3, K/ 23 (Item 16 fromfi I e: 350) 

Dl ALOG( R) Fi I e 350: Der went WI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0005506674 - Dr awi ng avai I abl e 
WI ACC NQ 1991-109665/199115 
XRPX Acc No: N1 991 -084511 

Voice recognition system for security functions - has audio signal 
converted into bytes representing t i rre between audio signal zero crossings 
Patent Assignee: SEARCY G ( SEAR- 1 ) 
I nvent or : KAVAN F; SEARCY G 
Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5003603 A 19910326 US 1984642299 A 19840820 199115 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 1984642299 A 19840820 

Voice recognition systemfor securi t y f unct i ons. . . 

Cr i gi nal Ti 1 1 es: 
Voice recognition system 

Alerting Abstract ...intervals which are based on a corrput at i on of 
subst ant i al I y equal byt e act i vi t y i n al I t he words corrpr i si ng t he corrmand 
I exi con . 



...In this manner, lower and higher frequency groups are selected for equal 
significance. The uttered words are then corrpared against stored words 
sim'larly transformed according to segment and frequency interval and if 
the corrparison conditions are satisfied... 

... USE/ ADVANTAGE - Also for voice - responsive remote control system 
Recognises spoken commands uttered by user and generates responsive 
control signals once the corrmand is recognised. (§>11pp Dwg.No. 1/7)@ 

Q 3.ss CocJgs 
(Addi ti onal / Secondary) : G10L-005/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A method and apparatus are disclosed for recognizing spoken corrmands 
uttered by a user and for generating responsive control signals once 
the corrmand is recognized. In accordance with this disclosure... 

. . . i nt er val s whi ch ar e based on a corrput at i on of subst ant i al I y equal byt e 
act i vi t y i n al I the words corrpr i sing the corrmand lexicon . In this 
manner, lower and higher frequency groups are selected for equal 
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significance. The uttered words are then corrpared against stored words 
sirrilarly transformed according to segment and frequency interval and if 
the comparison conditions are satisfied the... 



17/3, K/ 24 (Item 17 fromfi I e: 350) 

Dl ALOG( R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0005211116 - Drawing available 
Wl ACC NQ 1990-202891/199027 
XRPX Acc No: N1 990- 157892 

Speech recognition system- displays word menu to user for acceptance 
or prior editing before association with acoustic model 
Pat ent Assi gnee: DRAQCN SYST I NO ( DRAG N) ; DRAGCN SYSTEMS I NO ( DRAG- N) 
I nvent or : BAKER J K; PCRTER E W ROBERTS J 
Patent Farri I y (5 patents, 13 countries) 

Appl i cat i on 
Nurrber Ki nd 



Pat ent 
Nurrber 
EP 376501 
ES 2018761 
US 5027406 
EP 376501 
DE 68928097 



Ki nd 
A 
A 
A 
B1 
E 



Dat e 
19900704 
19910516 
19910625 
19970604 
19970710 



EP 1989312521 

US 1988280700 
EP 1989312521 
DE 68928097 
EP 1989312521 



Dat e 
19891130 

19881206 
19891130 
19891130 
19891130 



Updat e 
199027 

199125 
199128 
199727 
199733 



Pri ori ty Appl i cat i ons (no. , kind, date): US 1988280700 A 19881206 

Pat ent Det ai I s 

Nurrber Ki nd Lan 

EP 376501 A EN 

Regional Designated St at es, O i gi nal 
SE 

EP 376501 B1 EN 47 36 

Regional Designated St at es, Cr i gi nal 
SE 

DE 68928097 E DE 



Pg Dwg Fi I i ng Not es 

AT BE CH DE ES FR GB GR I T LI LU NL 

AT BE CH DE ES FR GB GR I T LI LU NL 

Appl i cat! on EP 1989312521 
Based on CP I patent EP 376501 



Speech recognition system.. 

...displays word rrenu to user for acceptance or prior editing before 
associ at i on wi t h acoust i c model 

Cr i gi nal Ti 1 1 es: 

... Speech recognition system.. 
... Speech recognition system.. 

. . . IVet hod for interactive speech recognition and training 

Alerting Abstract ...The speech recognition syst em pr ovi des a method 
of creating word model s. An acoustic model is made from one or more 
utterances of a word or group of words spoken as a single utterance . 
A group of textual characters is associated with this utterance and 
displayed t o t he user. If the word is not correctly recognised a rrenu of 
such groups may be displayed and the user... 

...The syst em saves tokens only for utterances when the user specifically 
confirms the group of characters to be associated with that utterance . 
All the tokens associated wi t h a word are stored and when a confirmation 
is received a new acoust i c model is built using... 

Equivalent AI er t i ng Abst r act . . . As t he user is dictating, the user 
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speaks a word which may or may not already be in the active vocabulary 
. The syst em di spl ays a I i st of the words i n t he active vocabulary 
which best match the spoken word . By keyboard or voice conmrand , the 
user may choose the correct word fromthe list or may choose to edit a 
similar word if the correct word is not on the list. Alternately, the 
user may type or speak the initial letters of the word . Then the 
r ecogni t i on al gor i t hm i s cal I ed agai n sat i sf yi ng t he i ni t i al I et t er s, and 
the choices displayed again. A word list is then also displayed from a 
large backup vocabulary . The best words to display fromthe backup 
vocabulary are chosen using a statistical language model and optionally 
word models derived forma phoneme dictionary . Wien the correct word 
is chosen by the user, the speech sample is used to create or update an 
acoustic model for the word , without further intervention by the user. As 
the syst em i s used, it also constantly updates... 

. . . USE/ ADVANTAGE - Creating word models for large vocabulary , natural 
language dictation system User with I i ni it ed typing skills can create 
document s wi t h little or no advance t r ai ni ng of wor d model s. . . 

Ti 1 1 e Ter rrs. . . / 1 ndex Ter rrs/ Addi t i onal W)rds: VCPD ; 

a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-005/06 
I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
G10L-0015/06 . . . 

... G10L-0015/22 
G10L-0015/00 . . . 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A syst em for creating word models comprising means for making an acoustic 

model from one or more utterances of word , means for enabling a user to 

associate a sequence of textual characters with that acoustic... 

... A met hod f or cr eat i ng word rrodel s f or a I ar ge vocabul ary , nat ur al 
I anguage di ct at i on syst em A user wi t h I i ni t ed t ypi ng skills can cr eat e 
documents with little or no advance training of word models. As the user 

is dictating, the user speaks a word which may or may not already be 
i n t he active vocabulary . The syst em di spl ays a I i st of t he words in 
the active vocabulary which best mat ch the spoken word . By keyboard or 
voice corrmand, the user may choose the correct word fromthe list or may 
choose t o edi t a si rri I ar word if t he correct word i s not on t he I i st . 
Alternately, the user may type or speak the initial letters of the 
word . Then the recognition algorithm is called again satisfying the 
initial letters, and the choices displayed again. A word list is then 
also displayed from a large backup vocabulary . The best words to 
display fromthe backup vocabulary are chosen using a statistical 
language model and optionally word models derived from a phonemic 
dictionary . Wien the correct word is chosen by the user, the speech 
sample is used to create or update an acoustic model for the word , 
without further intervention by the user. As the syst em i s used, it also 
constantly updates its statistical language model. The syst em gets more and 
more word models and keeps improving its performance the more it is used. 
The syst em may be used for connected speech as well as for discrete 
utterances. > 
a ai rrs: 

The speech recognition syst em pr ovi des a method of creating word 
models. An acoustic model is made from one or more utterances of a word 
or group of words spoken as a single utterance . A group of textual 
characters is associated with this utterance and displayed t o t he user. 
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If the word is not correctly recognised a menu of such groups may be 
displayed and the user... 

...The system saves tokens only for utterances when the user specifically 
confirms the group of characters to be associated with that utterance . 
All the tokens associated wi t h a word are stored and when a confirmation 
is received a new acoust i c model is built using... 

...1. A speech recognition system for performng speech recognition 

on an uttered word corrpr i si ng</ br > - means (124) for storing a set of 
acoustic word models (132), each comprising acoustic data, with each such 
acoustic word model (132) being stored in association wi t h a word label 
(504),</br> - means (112-118) for forrring an acoustic description of an 
uttered word to be r ecogni sed, </ br > - recognition rreans (129) for 
compar i ng the acoust i c descr i pt i on t hus f or med agai nst a pi ur al i t y of 
stored acoustic word models (132) stored in said means for storing a set 
of acoustic word models (124- 128) and for selecting an acoustic word 
model (132) which best matches the acoustic description, characterised in 
t hat means (208-214) are. . . 

...data from the acoustic description with the acoustic data stored for the 
best matching acoustic word model (132) thereby to update (180) the 
acoustic word model (132), and rreans (216) are provided for storing the 
updated acoustic word model (132) in said set of stored acoustic word 
rrodel s such that the updated acoustic word model (132) is used i n t he 
subsequent performance of speech recognition utilising said system 



19/3, K/1 (Item 1 fromfile: 350) 

Dl ALOQ( R) Fi I e 350: Der went VNPI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0012906288 - Drawing available 
WI AOC NG 2002-229223/200229 
XRPX Acc No: N2002- 176198 

Speech recognition system for rrobi I e corrrruni cat i on devices is capable 
of rapidly processing greater varieties of words and operable in many 
di f f erent devi ces 

Patent Assignee: VERBALTEK CO LTD ( VERB- N) ; VERBALTEK I NC ( VERB- N) 
I nvent or : CHANG J; CHEN J; CHEN J Y; KI M Y; PAN J 
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Pri ori ty Appl i cat i ons (no. , kind, date): US 2000538657 A 20000330 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

EP 1139332 A2 EN 19 10 

Regi onal Desi gnat ed St at es, Cr i gi nal : AL AT BE CH CY DE DK ES FI FR GB GR 

I E I T LI LT LU LV M3 IvK NL PT RO SE SI TR 
EP 1139332 A9 EN 

Regi onal Desi gnated States, CM gi nal : AL AT BE CH CY DE DK ES FI FR GB GR 

I E I T LI LT LU LV M3 IvK NL PT RO SE SI TR 
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TW 504663 A ZH 



Speech recognition syst em f or rrobi I e corrrruni cat i on devices is capable 
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of rapidly processing greater varieties of words and... 

Cr i gi nal Ti 1 1 es: 

. . . Spel I i ng speech 



recognition apparatus... 
RECOGNI TI ON SYSTEM AND METHOD FOR M3BI LE 



. . . SYLLABI C SPEECH 
CCMVUNI CATI ON. . . 

...Spelling speech recognition apparatus and method for comruni cat i ons. 

AI er t i ng Abst r act ...NOVELTY - A f r ont - end si gnal processor i s used f or 
generating parametric representation of speech input signals. The 
parametric signals are corrpared with the parametric representations of 
letter pronunciations. Then a sequence of associations between the input 
speech , t he I et t er s i n a pronunciation database (103) and a vocabulary 
database (203) .These selected letters and words are displayed for 
conf i r mat i on. . . ar e also included for a letter sirrilarity corrpar at or , an 
electronic comruni cat i on device, a rret hod of recognizing speech sound 
si gnal s. . . 

. . . DESCRI PTI ON OF DRAW NG3 - The drawi ng shows a bl ock di agram of t he 
speech recognition syst em accordi ng to the present invention... 

G ass Codes 

International Q assi f i cat i on (Main): G10L-015/00 
I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
G10L-0015/02 . . . 

G10L-0015/10 . . . 

G10L-0015/ 18 . . . 

G10L- 0015/ 22 
G10L-0015/00 . . . 

gi nal Publication Data by Authority 



Cr i gi nal Abst ract s: 
An accurate speech recognition syst em capabl e of rapidly 
processing greater varieties of words and operable in many different 
devices, but without the computational power and memory requirements, high 
power consurrpt i on, corrpl ex operating system, high costs, and weight 
of traditional systems. The utilization of individual letter utterances to 
transmit words allows voice information... 

...phones, PDAs, and other corrrruni cat i on devices. This invention is an 
apparatus and rret hod for a speech recognition syst em compr i si ng a 
rricrophone , front -end signal processor for generating parametric 
representations of speech input signals, a pronunciation database , 
a letter sirrilarity comparator for comparing the parametric representation 
of the input signals with the par arret r i c representations of letter 
pronunciations, and generating a sequence of associations between the 
input speech and the letters in the pronunciation database, a 
vocabulary database, a word sirrilarity corrpar at or for corrpar ing an 
aggr egat ed pi ur al i t y of t he I et t ers wi t h t he words i n t he vocabul ary 
database and generating a sequence of associations between them and a 
display for displaying the selected letters and words for confirmation. 



...An accurate speech recognition syst em capabl e of rapidly 
processing greater varieties of words and operable in many different 



10501857 Language Recognizer And Operating Ivet hod Therefor 



devices, but without the computational power and memory requirements, high 
power consumption, corrpl ex operating system, high costs, and weight 
of traditional systems. The utilization of individual letter utterances to 
transrrit words allows voice information transfer... 

...phones, PDAs, and other cormruni cat i on devices. This invention is an 
apparatus and method for a speech recognition syst em corrpr i si ng a 
m'crophone , front -end signal processor for generating parametric 
representations of speech input signals, a pronunciation database , 
a I et t er si rri I ar i t y corrparator for corrparing the parametric representation 
of the input signals with the par arret ric representations of letter 
pronunciations, and generating a sequence of associations between the 
input speech and the letters in the pronunciation database, a 
vocabulary database, a word sirrilarity corrparator for corrparing an 
aggr egat ed pi ur al i t y of t he I et t ers wi t h t he words in t he vocabul ary 
database and generating a sequence of associations between them and a 
display for displaying the selected letters and words for confirmation, 
a ai rrs: 

A speech recognition system corrpr i sing : rricrophone means for 
receiving acoustic waves and converting the acoustic waves into electronic 
si gnal s; f r ont . . . 

...electronic signals and said plurality of parametric representations of 
letter pronunciations responsive to predetermined criteria; vocabulary 
database storage rreans for storing a plurality of parametric 
representations of word pronunciations; word sirrilarity comparator 
means, coupled to said letter sirrilarity corrparator and to said 
vocabulary database storage rreans, for corrparing an aggregated plurality 
of parametric representations of letter pronunciations with said 
plurality of par arret ric representations of word pronunciations, and 
generating a second sequence of associations between at least one of said 
aggregated plurality of par arret ric representations of the letter 
pronunciations wi t h at least one of said... 

...A speech recognition syst em corrpr i si ng: rricrophone rreans for 
receiving acoustic waves and converting the acoustic waves into electronic 
signals; front -end signal processing rreans , coupled to said 
rricrophone rreans, for processing the electronic signals to generate 
par arret r i c representations of the... 

...electronic signals and said plurality of parametric representations of 
letter pronunciations responsive to predetermined criteria; vocabulary 
database storage rreans for storing a plurality of parametric 
representations of word pronunciations; word sirrilarity corrparator 
rreans, coupled to said letter sirrilarity corrparator and to said vocabulary 
database storage rreans, for corrparing an aggregated plurality of 
parametric representations of letter pronunciations with said plurality 
of par arret ric representations of word pronunciations, and generating a 
second sequence of associations between at least one of said aggregated 
plurality of par arret r i c representations of the letter pronunciations with 
at least one of said plurality of parametric representations of word 
pronunciations responsive to predetermined criteria; and display means, 
coupled to said word sirrilarity corrparator rreans... 



A 19/3, K/ 2 (I tern 2 f romf i I e: 350) 
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0009806402 - Drawl ng avai I abl e 
WI ACC NQ 2000-096022/200008 
XRPX Acc No: N2000-074109 

Corrputer program in reading machine with recursive dictionary and talking 
help rrenu for persons with visual irrpairrrent 
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Pat ent Assi gnee: KURZ\AEI L EDUCATI CNAL SYSTEIvB I NC ( KURZ- N) 
I nvent or : Dl ONNE M S; QGFM\N M T 
Patent Fanri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5999903 A 19991207 US 1997883723 A 19970627 200008 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 1 997883723 A 19970627 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 5999903 A EN 24 12 

AI ert i ng Abst r act . . . NQ/ELTY - A t ext file is pr oduced f or sel ect ed wor d, 
that corrprises data assisting user. An operating system routine is 
called to corrrrand the operating system for producing a new wi ndow on a 
di spl ay devi ce. The wi ndow is f i I I ed wi t h dat a. . . 

...ADVANTAGE - Provides enrichment feature such as spelling, dictionary 
in recursive manner by calling word by word as per user desire, thus 
enabling user to seek and read definition of word while the word is 
spel I ed, def i ned and t r ansl at ed. . . 

G ass Codes 

... (Addi t i onal /Secondary) : G10L-009/06 
Or i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A reading machine includes computer pr ogr am r esi di ng on a computer 
readable medi urn oper abl e wi t h an operating system that per nits the 
production of windows on a display. The program has instructions for 
causing a corrput er to provide... 

...file for the first selected word corrprised of data that assists the user 
call. An operating system routine corrrrands the operating system 
t o cr eat e a new wi ndow on t he di spl ay and fill t he new wi ndow on t he 
di spl ay wi t h t he i nf or mat i on. . . 
Q ai nrs: 

... di spl ay for di spl ayi ng t ext or an i mage r epr esent at i on of t ext ; sai d 
corrput er pr ogr am oper abl e with an operating system that per nits the 
production of windows on a display , said corrput er pr ogr am enabl i ng the 
r eadi ng rrachi ne to synthesize speech f r om a f i r st text file 
corresponding to the text or image representation of a document 
displayed on the display, said corrput er pr ogr am conpr i si ng instructions... 

... a synonym or a f orei gn t ransl at i on of a word f rom t he f i rst t ext file; 
call an operating syst em r out i ne t o corrrrand the operating syst em t o 
create a new wi ndow on the display ; fill the new wi ndow on the 
di spl ay wi t h t he dat a cont ai ned i n t he second t ext file; pr ovi de a second 
cal I t o a. . . 



19/3, K/ 3 (Item 3 fromfile: 350) 

DI ALCG( R) Fi I e 350: Der went VNPI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0009666575 - Drawing available 
VNPI ACC ND 1999-620103/ 199953 
XRPX Acc No: N1 999- 457378 

Processor controlled application program operating system with voice 
r ecogni t i on 

Patent Assignee: IVCTORCLA I NC ( IvDTI ) 
I nvent or : BALAKRI SHNAN S 
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CN 1109328 
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20030521 


CN 1999800423 
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200541 
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Pri ori ty Appl i cat i ons (no. , kind, date): US 1 99853432 A 19980401 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

WD 1999050831 A1 EN 22 5 

Nat i onal Desi gnat ed St at es, Cr i gi nal : AL AM AT AU AZ BA BB BG BR BY CA CH 
CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU I D I L I N I S J P KE KG KP KR 
KZ LC LK LR LS LT LU LV MD M3 M< MM IWV IVK NO NZ PL PT RO RU SD SE SG SI 
SK SL TJ TM TR TT UA UG UZ VN YU ZW 
Regi onal Desi gnat ed St at es, Q i gi nal : AT BE CH CY DE DK EA ES FI FR GB GH 

GM GR I E I T KE LS LU MD IvW NL CA PT SD SE SL SZ UG ZW 
AU 199930805 A EN Based on CPI patent WD 1999050831 

EP 986809 A1 EN PCT Appl i cat i on WD1999US5339 

Based on CPI patent WD 1999050831 
Regional Desi gnat ed St at es, Cr i gi nal : DE FI FR GB NL SE 
EP 986809 B1 EN PCT Appl i cat i on WD1999US5339 

Based on CPI patent WD 1999050831 
Regional Desi gnat ed St at es, Cr i gi nal : DE FI FR GB NL SE 
DE 69910928 E DE Application EP 1999912430 

PCT Appl i cat i on WD 1999US5339 
Based on CPI patent EP 986809 
Based on CPI patent WD 1999050831 

Processor controlled application program operating system with voice 
r ecogni t i on 

Cr i gi nal Ti 1 1 es: 

. . . CQVPUTER OPERATI NG SYSTEM W TH VQ CE RECCGNI TI CN 



. . . SPEECH RECCGNI TI CN METHOD W TH MJLTI PLE APPL I CATI ON PROGRAMVB. . . 
. . . CQVPUTER OPERATI NG SYSTEM W TH VO CE RECCGNI TI CN 

AI er t i ng Abst r act ... NOVELTY - An appl i cat i on ( 32) is under t he cont r ol 
of operating system (30) and is modular I y separated from the system 
(30). The application contains a voice recognizer (36) for recognizing 

voice corrmands which are presented to an arbitrator (70) for recognizing 
information indicative of the voice recognizer 's ability to recognize 
the voice corrmands. DESCRI PTI CN - The application corrprises a dictionary 
and language model. The recognition information is dependent on the 
di ct i onary and t he I anguage model . An I NDEPENDENT CLAI Mis al so. . . 
...USE- For indicating the operating system whether given speech 
input is useful t o t he individual applications... 

...ADVANTAGE - Application is better situated than the operating system 
for determining as whether certain speech is relevant t o t he application. 
The direction of speech t o t he operating system or t o a particular 
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appl i cat i on can be bet t er deci ded by t he appl i cat i on. . . 
... 30 Cperat i ng syst em 



. . . 36 Voi ce recogni zer 



a ass Codes 

International Q assi f i cat i on ( Ivai n) : G10L-015/26 
I nt er nat i onal 0 assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 

. . . G10L-0015/26 

. . . G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

Acorrputer operating system having a voice recognition tool (36) 
and first and second application programs (32 and 34). The operating 
system (30) receives information from the first and second application 
programs relating to control words for controlling the application 
programs. This information is illustrated as vocabulary A( 74) and 
vocabulary B( 76) . The voice recognition tool (36) is arranged to 
accept t he i nf or mat i on f r om t he f i rst and second appl i cat i on pr ogr ams and 
per for ma comparison with digitized voice received at an input (60). A 
selected one of the application programs (32 and 34) is controlled 
accordi ng to... 

...Acorrputer operating system (<b>30</b>) receives information from 



controlling the application programs. This information is contained in 
vocabulary A(<b>74</b>) and vocabulary B ( <b>76</ b>) . The speech 
recognition tool (<b>36</b>) is arranged to accept the information from 
the first and second application programs and perform a comparison with 
di gi t i zed speech r ecei ved at an i nput . . . 

...Acorrputer operating system having a voice recognition tool (36) 
and first and second application programs (32 and 34). The operating 
system (30) receives information from the first and second 
appl i cat i on pr ogr ams r el at i ng to cont r ol words f or cont r ol I i ng t he 
application programs. This information is illustrated as vocabulary 
A(74) and vocabulary B( 76) . The voice recognition tool (36) is 
arranged to accept the information from the first and second application 
programs and perform a comparison with digitized voice received at an 
i nput ( 60) . A sel ect ed one of t he appl i cat i on pr ogr ams ( 32 and 34) is... 
G ai rrs: 

. . . A met hod of operation of a corrput er having an operating system and 
at least two application programs, corrpr i si ng: ent er i ng voice 
commands; corrpar i ng the voice corrrrands entered with representations of the 
voice corrrrands associated with each application program to provide a 
recognition result corresponding to each application program providing 
the recognition result of each application program to the operating 
syst em ; i dent i f yi ng an appl i cat i on pr ogr am t hat i s i n t he f or egr ound and at 
least one application program that is in the background based on the 
recognition results ; and <b>char act er i sed by</ b>sel ect i vel y permtting 
a particular application program of the at least two application programs 



. . . A pr ocessor corrpr i si ng: an input for r ecei vi ng spoken corrrrands; an 
operating system having an ar bi t r at or ; one or rrore applications each 
having a search applet, the one or rrore applications under the control of 
the operating system and rrodularly separate from the operating 
syst em; one or rrore vocabularies that may be called by the search applet, 



f i rst and second 




words for 
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each of the one or more vocabularies having corrrrands pertinent to one of 
each said applications ; and one or more language models that may be 
called by the search applet, each of the one or more language models having 
language constraints pertinent to one of each said applications; the 
operating system corrprising a speech recognizer for recognizing 
spoken corrrrands and, based on the data found i n t he vocabularies and 
I anguage model s, obt ai ni ng. . . 

...one or more applications to automatically present to the arbitrator 
r ecogni t i on i nf or mat i on i ndi cat i ve of the voice r ecogni zer pr i rres 
ability to recognize the spoken corrrrands, t he ar bi t r at or deter rrining 
based on the confidence level which of the at least one or more 
applications is to respond to the spoken corrrrands and perrritting the 
det er rri ned appl i cat i on to act on t he spoken command and i nst r uct i ng. . . 

...the arbitrator further adapted to direct commands to one of the one or 
more applications based on whether the one application is in the 
foreground or in the background. 



1 9/ 3, K/ 4 ( 1 1 em 4 f r om f i I e: 350) 

Dl ALOG( R) Fi I e 350: Der went V\PI X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0009463474 - Drawing available 
\API ACC NG 1999-403770/ 199934 

Related V\PI Acc No: 1995-240307; 1999-070054; 1999-404055; 1999-404056; 

1999- 456293; 1999-600343; 2000-052130; 2000-586103; 2000-636971; 

2000- 637172 

XRPX Acc No: N1 999- 300862 

Oorrput er i zed speech recognition method for use in rail t i t aski ng 
operat i on syst em 

Pat ent Assi gnee: DRAQCN SYSTEM I NC ( DRAG- N) 

I nventor: GOULD J IVt IvCGRATH F J; PARKE J W ROBERTS J M SOW RES S D 
Patent Farri I y (1 patents, 1 countries) 
Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5909666 A 19990601 US 1992976413 A 19921113 199934 B 

US 1995382752 A 19950201 
US 1997883293 A 19970626 



Pr i or i t y Appl i cat i ons ( no. 



kind, date): US 1992976413 A 19921113; US 
1995382752 A 19950201; US 1997883293 A 19970626 



Pat ent Det ai I s 

Nurrber Ki nd 

US 5909666 A 



Lan 
EN 



Fi I i ng Not es 

C-l-P of appl i cat i on 

□ vi si on of appl i cat i 



US 1992976413 
>n US 1995382752 



C-l-P of pat ent US 5428707 
recognition rret hod for use in rail t i t aski ng 



Corrput er i zed speech 
operat i on syst em 

Cr i gi nal Ti 1 1 es: 

Speech recognition syst em which creates acoustic models by 
concatenating acoustic models of individual words. 

AI ert i ng Abst ract ... NOVELTY - An acoust i c word model and spel I i ng f or 
several vocabulary words are stored. Anew vocabulary word is 

creat ed f rom spel I i ng of a phrase of two or more words . A new acoust i c 
model is created for new vocabulary word by concatenating acoustic 
models of individual words of the phr ase. DESCRI PTI ON - Acoustic word 
signals representing sound of spoken words or pattern matched against 
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stored vocabulary words , to produce scores indicating relative 
probability that given word signal corresponds to different stored 
models. Che vocabulary word is selected as the most probable one 
corresponding to given word signal as a function of scores produced by 
pat t er n mat chi ng uni t . . . 

. . . USE - Use i n pr eerrpt i ve rail t i t aski ng operat i ng syst em such as 
Mcrosoft NT( RTIv) , UNI X ( FTTIV) system.. 

. . . ADVANTAGE - Wien used to train speech recognition system user is 
able to control syst em by voice even when training by acoustic models. 
Probability of recognition is increased even when character recognized by 
vocabulary is not represented in signal. Alphabetic filtering is used to 
limit recognition without considering match of entered word with letter 
i n desi r ed word . 



. . . DESCRI PTI ON CF DRAW NG3 - The f i gur e shows schemat i c r epr esent at i on of 
computer syst em i n which speech recognition is carried out. 

a ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 

. . . G10L-0015/06 . . . 
. . . G10L-0015/18 . . . 

. . . G10L-0015/22 . . . 

. . . G10L-0015/26 . . . 

... G10L-0015/28 
. . . G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

A corrput er i zed speech recognition syst em cr eat es acoustic models of 

phrases by concatenating acoustic models for individual words . The system 
stores an acoustic word model and spelling for each of its vocabulary words 



...created by concatenating the acoustic word models of previous vocabulary 
words whose spellings correspond to words in the rail t i - word spelling as 
the acoustic model for the new word . The syst em can then perform 
speech recognition by corrpar i ng acoustic signals against the word models 
of stored vocabulary words , including those representing such 
rail t i - word phrases. Preferably when a mil t i - word model is formed, 
the individual acoustic models concatenated are modified to represent the 
coart i cul at i on which takes place between words spoken continuously. This 
can be done by representing word models as sequences of individual phonemes 



...a user or be obtained from other programs running on the same computer 
as the speech recognizer , such as from one or more commands 
available in another program 
G ai rrs: 

A corrput er i zed method of perforrring speech recognition corrprising 
the steps of: storing, for each of a plurality of vocabulary words, an 
acoust i c word model and a spel I i ng; recei vi ng t he spel I i ng of a phrase of . 

...pattern matching the word signals against the word models of stored 
vocabulary words, including said new vocabulary word , to produces 
scores indicating the relative probability that a given word signal 
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corresponds to different ones of said stored word models; 
andsel ect i ng one or more of said vocabulary words as most probably 
corresponding to a given word signal as a function of said scores 
produced by said pattern mat chi ng. > 



1 9/ 3, K/ 5 ( 1 1 em 5 f r om f i I e: 350) 

Dl ALOG( R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0009454304 - Drawing available 
Wl ACC NG 1999-393933/199933 
XRPX Acc No: N1 999- 294358 

Voice activated text correction rret hod for corrputer voice recognition 
syst em 

Pat ent Assi gnee: I NT BUSI NESS MACHI NES CCRP ( I BM3) 

I nvent or : CCE K; LECNTI ADES K 

Patent Farri I y (1 patents, 1 countries) 

Pat ent Appl i cat i on 

Nurrber Ki nd Dat e Nurrber Ki nd Dat e Updat e 

US 5909667 A 19990601 US 1997812387 A 19970305 199933 B 

Pri ori ty Appl i cat i ons (no. , kind, date): US 1997812387 A 19970305 

Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

US 5909667 A EN 20 9 

Voice activated text correction rret hod for corrputer voice recognition 
syst em 

Alerting Abstract ...NOVELTY - An error word spoken by a user is received 
and identified by a speech recognizing application (26) and a speech 
text processor application (28). The identified word is matched wi t h a 
suitably displayed word. Che or rrore. .. 

. . . DESCRI PTI ON CF DRAW NGS - The f i gur e shows bl ock di agr am of t he corrput er 
voice recognition system.. 

... 26 Speech recogni t i on engi ne appl i cat i on. . . 
...28 Speech text processor application 
a ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 

. . . G10L-0015/22 

. . . G10L-0015/00 

Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 



Cr i gi nal Abst ract s: 

. . . corrpr i sed of " m" words and the value of mis preferably determined, 
based upon the voice recognition capabilities of a particular 
voice recognition engine associated with the dictation system The 
greater the ability of the speech recognition engine to recognize 
and di st i ngui sh spoken words , the larger the value of m A correction 
vocabulary is defined which consists of the plurality of words which 
are contained within the correction wi ndow and a plurality of correction 
commands. The voice recognition computer dictation system is 
configured for processing a spoken utterance which corresponds to an 
error word contained within the plurality of words highlighted in the... 
a ai rrs: 

In a corrputer syst em (20) having an operating system ( 24 ) , a memory 
(27), and processing means (26, 28) for receiving spoken words and 
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converting said spoken. 



1 9/ 3, K/ 6 ( 1 1 em 6 f r om f i I e: 350) 

Dl ALOG( R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0008632386 - Drawing available 
Wl ACC NQ 1998- 169400/ 199815 
XRPX Acc No: N1998- 134450 
Speech recognition manager for processing system - has speech 
pr ocessor usi ng synt ax file r el at i ng to def i ned cont ext and changi ng file 
for context changing input, using processing syst em operat i ng in accordance 
with one or rrore application programs responsive to input conmrands 
Patent Assignee: VI RTUAL VI SI CN I NC ( VI RT- N) 
I nvent or : GATALLO L R; ^LLEY J A 
Patent Farri I y (3 patents, 21 countries) 



Pat ent 
Nurrber 

VC 1998008215 
US 5867817 
EP 979505 



Ki nd Dat e 

A1 19980226 

A 19990202 

A1 20000216 



Appl i cat i on 
Nurrber 

WO 1997US13021 
US 1996699632 
EP 1997936202 
WO 1997US13021 



Ki nd Dat e 

A 19970724 

A 19960819 

A 19970724 

A 19970724 



Updat e 
199815 
199912 
200014 



Pri ori ty Appl i cat i ons (no. , kind, date): US 1996699632 A 19960819 
Pat ent Det ai I s 

Nurrber Ki nd Lan Pg Dwg Fi I i ng Not es 

WD 1998008215 A1 EN 58 12 

National Designated St at es, Cf i gi nal : GA JP 

Regi onal Desi gnat ed St at es, Cr i gi nal : AT BE CH DE DK ES FI FR GB GR I E I T 
LU M3 NL PT SE 



EP 979505 



A1 EN 



PCT Appl i cat i on WO 1 997US1 3021 
Based on CPI patent WD 1998008215 
Regi onal Desi gnat ed St at es, Cr i gi nal : AT BE CH DE DK ES FI FR GB GR I E I T 
LI LU M3 NL PT SE 

Speech recognition manager for processing system.. 

...has speech processor using syntax file relating to defined context 
and changing file for context changing input, using... 

Cr i gi nal Ti 1 1 es: 

. . . SPEECH RECCGNI TI CN MANAGER. . . 

... Speech recognition manager... 

. . . SPEECH RECCGNI TI CN MANAGER 

Alerting Abstract ...The speech recognition manager has a memory for 
storing data for several context states. The manager also has... 

. . . ADVANTAGE - AI I ows words to have rail t i pi e rreani ngs and i rrproves 
detection by Uniting dictionary to current context. 

G ass Codes 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Val ue Posi t i on St at us Versi on 
G10L-0015/22 . . . 

... G10L-0015/26 
G10L-0015/00 . . . 



Cr i gi nal Publ i cat i on Dat a by Aut hor i t y 
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Cr i gi nal Abst ract s: 
A speech recognition manager receives representations of one or 

more words f r om a speech decoding system (106) and interprets the received 
wor ds. . . 

...upon the current context state so as to provide extremely accurate, 
flexible, extendable and scalable speech recognition and 
interpretation. The speech recognition manager limits the nurrber 
of words that the speech decoding system (106) can recognize in a given 
context state in order to increase the speed and accuracy of the speech 
recognition process . Wienever the context state changes, the 
manager loads a new I i st of words that can be recognized for the... 

. . . i nt o t he speech decodi ng syst em ( 106) so t hat whi I e t he speed and 
accuracy of the speech recognition process is increased , the 
total grammatical structure recognized can be easily increased as well... 

...A speech recognition manager receives representations of one or more 
words from a speech decoding system and interprets the received words 
based upon the current context state so as to provide extremely accurate, 
flexible, extendable and scalable speech recognition and 
interpretation. The speech recognition manager Units the nurrber of 
words that the speech decoding syst em can recognize in a given context 
state in order to increase the speed and accuracy of the speech 
recognition process . Wienever the context state changes, the manager 
loads a new I i st of words that can be recognized for the new context 
state into the speech decoding syst em so that while the speed and 
accuracy of the speech recognition process is increased, the total 
grarrmatical structure recognized can be easily increased as well... 

...A speech recognition manager receives representations of one or more 
words from a speech decoding system ( 106 ) and interprets the received 
words based upon the current context state so as to provide extremely 
accurate, flexible, extendable and scalable speech recognition and 
interpretation. The speech recognition manager Units the nurrber of 
words that the speech decoding syst em (106) can recognize in a 
given context state in order to increase the speed and accuracy of the 
speech recognition process . Wienever the context state changes, the 
manager loads a new I i st of words that can be recognized for the new 
context state into the speech decoding syst em (106) so that while the speed 
and accuracy of the speech recognition process is increased, the 
total grarrmatical structure recognized can be easily increased as well. 

G ai rrs: 

A speech recognition manager for a processing system operating 
in accordance with one or more application programs responsive to input 
commands, said manager receiving representations... 



1 9/ 3, K/ 7 ( 1 1 em 7 f r om f i I e: 350) 

Dl ALOG( R) Fi I e 350: Der went Wl X 

(c) 2008 The Thomson Corporation. All rts. reserv. 

0007538844 - Dr awi ng avai I abl e 
W>l ACC NG 1996-152902/199616 
XRPX Acc No: N1 996- 128436 

Speech- act i vat ed information system- has word graph obtained by evaluation 
of individual words converted into concept graph evaluated for addressing 
response word data bank 

Pat ent Assi gnee: KCNI NK PHI LI PS ELECTRCNI CS NV ( PHI G) ; PHI LI PS CCRP 
I INTELLECTUAL PROPERTY GVBH ( PHI G) ; PHI LI PS ELECTRON! CS NV ( PHI G) ; 
PHI LI PS GLOEI LA^ENFAB NV ( PHI G) ; PHI LI PS PATENTVEPAAALTUNG GVBH ( PHI G) ; 
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KR 199529601 


A 


19950912 


200336 
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Pri ori ty Appl i cat i ons (no., kind, date): DE 4432632 A 19940914; EP 
1995202394 A 19950905 



Fi I i ng Not es 
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Pat ent Det ai I s 
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AT BE CH DE FR GB I T LI NL 
Appl i cati on EP 1995202394 
Based on CPI pat ent EP 702353 
Previously issued patent KR 96011836 



Cr i gi nal Ti 1 1 es: 

. . . Syst em and met hod for out put ting synthetic speech in response to 
i nput speech si gnal s. . . 

... Syst em and method for out put ting synthetic speech in response to 
i nput speech si gnal s. . . 

... Syst em and method for out put ting spoken information in response to 
i nput speech si gnal s. 

G ass Codes 

...International Q assi f i cat i on ( Ivai n) : G10L-011/00 ... 
... G10L-015/18 

I nt er nat i onal Q assi f i cat i on ( + At t r i but es) 
I PC + Level Value Position Status Version 
. . . G10L-0013/00 . . . 

. . . G10L-0015/ 18 . . . 

... G10L-0015/28 

. . . G10L-0013/00 . . . 
... G10L-0015/00 



Cr i gi nal Publ i cat i on Dat a by Aut nor i t y 
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...advance. To this end, the syst em and the method are subdivided into 
several, essentially independently operating segment s with unarrbi guous 
interfaces. A first segment serves for the recognition of words and outputs 
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a. . . 

...sequences, or also single words, are replaced by concepts i n t he concept 
graph. In a further segment the concepts i n t he concept graph are bridged 
and gaps between the concepts are filled. A speech model is taken into 
account for this operation. A stochastic attributed grarmrar is used to 
deter nine the concepts from the word graph and to deter nine the meaning 
of the word sequences in the concepts. Because irrelevant filler words 

i n t he spoken request are ignored early on, the conput at i onal 
expenditure is corrpar at i vel y small despite the freely pronounced requests, 
a ai rrs: 

. . . A syst em f or out put t i ng speech i nf or mat i on i n r esponse t o input 
speech signals, corrpr i si ngf i rst means for corrpar i ng the speech signals 
with stored reference signals which correspond... 

. . . out put t i ng words deter rrined by the corrpar i sons, together wi t h a score 
and with statements as regards starting point and end point for each 
word output , and for storing the words output with the st at errent s in 
order to forma word graph f r om consecut i ve words, second means for 
r epl aci ng pr edet er rri ned. . . A syst em f or out put t i ng spoken i nf or mat i on in 
response to input speech signals, corrpr i si ng: f i r st means for corrpar i ng 
the speech signals with stored reference signals which correspond... 

...order to forma word graph f r om consecut i ve words, second rreans for 

replacing pr edet er rri ned word sequences in the word graph by 

pr edet er rri ned concepts wi t h a score which is derived from the sum.. 
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. . . Consor t i urn whi ch i s t he equi val ent of a de f act o Vfeb st andar d. The 
standard adds a speech recognition grammar format for words and 
phrases spoken in response to prompts. The voice browser uses a speech 
ser ver at t ached. . . 

. . . provi der' s f aci I i t y. Users di al i nt o a speech server t hat downl oads 
Voi ceXIVL 2. 0 scr i pt s, grarmrar formats, and audi o f i I es f r om an 
appl i cat i on ser ver . The voi ce br owser i nt er pr et s t he Voi ceXIVL 2. . . 

...a voice massage that can be prerecorded or in text sent through a 
text -to- speech synthesizer . The user responds by speaking or by 
touching the buttons of a t oucht one phone, and the public switched 
telephone network transfers the speech and tones from the phone, while the 
speech server processes it. The voice browser gains access to 
applications through the Internet. 
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This thesis investigates the i nrpl errent at i on and evaluation of corrrrercial 
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The semantic control is realized by the embedded program in each grarmrar 

rule. The parsing process proceeds i n a I ef t to right and bot torn up 
manner , sel ect i ng. . . 
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...partial parsed trees. In order to present an incorrect rule selection, 
NECLI N i nvokes the errbedded program which decides whether the rule 
appl i cat i on i s al I owed or not . The deci si on is made based on t he semant i c 
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ABSTRACT: The writer discusses speech recognition technology. Since 
Apple first launched voice control of its operating system in 1993, 
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in an attempt to match people's speech to words in its dictionary . 
Promoters tend to overlook three basic pr obi errs that will probably prevent 
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. . . Consor t i urn. whi ch i s t he equi val ent of a de f act o Wb st andar d. The 
standard adds a speech recognition grammar format for words and 

phrases spoken in response to prompts. The voice browser uses a speech 
ser ver at t ached. . . 

. . . provi der' s f aci I i t y. Users di al i nt o a speech server t hat downl oads 
Voi ceXM. 2. 0 scr i pt s, granmrar f or mat s, and audi o f i I es f r om an 
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appl i cat i on ser ver . The voi ce br ovuser i nt er pr et s t he Voi ceXM_ 2. . . 

...a voice message that can be prerecorded or in text sent through a 
text -to- speech synthesizer . The user responds by speaking or by 
touching the buttons of a t oucht one phone, and the public switched 
telephone network transfers the speech and tones from the phone, while the 
speech server processes it. The voice browser gains access to 
applications through the Internet. 
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... authorized terms is described. This I i 1 1 1 e- 1 out ed system which has 
been operating successfully since 1983, matches concepts, rather than 
words . Subject Switching uses a translation table, known as the Lexical 
Dictionary , accessed by a program that determines which rules to 
f ol I ow i n maki ng the t r ansi t i on f r om DTI C s to NASA. . . 
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... conditions for sentences, yielding eight experimental conditions. 
Apraxic speakers exhibited significantly longer vowel and between- word 
segment durations than control speakers in all conditions. Apraxic 
speakers consistently produced longer vowel and bet ween- word segment 
dur at i ons in... 

... intrasubject and inter subject variability for bet ween- word segment 
durations were substantially greater for the apraxic speakers in 
sentences corrpared to word conditions, whereas control speakers 
exhibited greater homogeneity in sentence production. The differences in 
duration and variability in sentence production versus word or word- string 
production i rrpl y different nrechani srrs for executing motor programs for 
varying linguistic st i rail i . 
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Abstract: Considerable progress has been made in speech - recognition 
technology and nowhere has this progress been more evident than in the area 
of I arge- vocabul ary recognition ( LVR) . Laboratory systems are capable of 
transcribing continuous speech from any speaker with average word -error 
rates between 5% and 10% If speaker adaptation is allowed, then after 2 or 



for most speakers. LVR systems had been United to dictation 
applications since the systems were speaker dependent and required words 
to be spoken wi t h a short pause between them However, the capability to 
recognize natural continuous- speech input from any speaker opens up 
many rrore applications This article discusses the principles and 
architecture of LVR systems and identifies the key issues... 
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Copyright 1995, I EE 

Title: Lar ge vocabul ary speech recognition 
Abstract: Considerable progress has been made in speech recognition 
technology over the last few years and nowhere has this progress been more 
evi dent t han. . . 

... Large Vocabulary Recognition ( LVR) . Current laboratory systems are 
capable of transcribing continuous speech from any speaker with average 
word error rates of between 5% and 10% If speaker adaptation is allowed 
t hen af t er 2. . . 

... for most speakers. Hitherto, LVR syst errs have been I i rri t ed t o di ct at i on 

applications since they were speaker dependent and they required words 
to be spoken wi t h a short pause between them The capability to recognise 
natural continuous speech input from any speaker, however, opens up 

many more applications and as a result LVR t echnol ogy appears to be on 

the brink of widespread depl oyrrent . . . 
Descriptors: speech recognition 

Identifiers: large vocabulary speech recognition ; ... 
...natural continuous speech input 
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Dl ALOG< R) Fi I e 6: NT" I S 

(c) 2008 NT I S, Intl Cpyrght All Rights Res. All rts. reserv. 

0983555 NTIS Accession Nurrber : AD-A118 444/9/XAB 
Voi ce Ver i f i cat i on Upgr ade 

(Final technical rept. 12 Ivay 80-31 Dec 81) 

Davis, R. L. ; Si nnarron, J. T. ; Cox, D. L. 

Texas I nst r urrent s, Inc., Dallas. 

Corp. Source Codes: 013046000; 347650 

Sponsor: Rorre Air Devel oprrent Center, Griff iss AFB, NY. 

Report No.: Tl - 08- 82- 07; RADC- TR- 82- 139 

Jun 82 246p 

Languages: Engl i sh 

Journal Announcement : GRAI 8225 

Oder this product from NTIS by: phone at 1 - 800- 553- NT I S (U.S. 
customers); (703)605-6000 (other countries); fax at (703)321-8547; and 
email at orders@itis.fedworld.gov. NTI S i s located at 5285 Port Royal Road, 
Spri ngf i el d, VA, 22161, USA. 

NTI S Pri ces: PC A11/M= A01 

...two major objectives. The first was to build, test, and deliver t o t he 
government an entry control system using speaker verification (voice 
authentication) as the rrechani sm f or verifying the user's claimed identity. 
Thi s syst em . . 

... scale to prevent rrore than one user from gaining access with one 
verification ( t ai I gat i ng) . The speaker verification part of the entry 
control system contained all the updates and errbel I i shrrent s to the 
al gor i t hm t hat was devel oped ear I i er . . . 

Descriptors: * Speech recognition ; * Voice corrrruni cat i ons; 

'Identification systems; * Access ; *Corrputer programs ; Algorithms; 
Val i dat i on; Conf i dence I evel 
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(c) 2008 Elsevier Eng. Info. Inc. All rts. reserv. 

11772933 E.I. No: El P0731 1 071 4622 
Title: Short rressage dictation on Syrrbi an series 60 rrobi I e phones 

Author: Karpov, E. ; Kiss, I.; Leppanen, J.; Q sen, J.; Cr i a, D. ; Si vadas, 
S. ; Ti an, J. 

Corporate Source: Nokia Research Center MjI t i rredi a Technologies 
Laboratory, 33720 Tarrpere, Finland 

Conference Title: I CM ' 06: 8th International Conference on Mil t i modal 
I nt erf aces 

Conference Location: Banff, AB, Canada Conference Date: 
20061102-20061104 

E. I . Conf er ence No. : 69919 

Source: I CM ' 06: 8th International Conference on MjI t i modal Interfaces, 
Conference Proceeding I CM 1 06: 8th International Conference on Mil t i modal 
Interfaces, Conference Proceedings 2006. 

Publ i cat i on Year : 2006 

ISBN: 9781595935410 

DO: 10.1145/ 1180995.1181020 

Language: Engl i sh 

...Abstract: devices is a challenging task. First, it involves memory 
and CPU- ef f i ci ent i rrpl errent at i on of robust speech recognition 
al gor i t hms t hat are gener al I y r esour ce demandi ng. Secondly, t he acoust i c 
and I anguage model s errpl oyed in t he. . . 

... cor r ect i on, and mist be sui t abl e f or a rrobi I e usage scenar i o. I n t hi s 
demonstrator, an errbedded speech recognition system for short message 
( SMS) di ct at i on in US Engl i sh is pr esent ed. The syst em i s r unni ng. . . 

. . . ar e srral I , 2 and 2. 5 rregabyt es, r espect i vel y. Af t er a shor t enr ol I rrent 
session, most native speakers can achieve a word accuracy of over 90% 
when dictating short messages in quiet or moderately noisy environments. 2 

Descriptors: * M)bi I e phones ; Dat a st or age equi pment ; Di ct at i ng machi nes; 
Error correction; Flash rrerrory; Message passing; Natural language 
processing systems; Program processors; Random access storage; Speech 

recogni t i on 

Identifiers: Short rressage dictation; Challenging task; Speech 
recognition algorithms; Speech language ; Nat i ve speakers 
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DI ALOG( R) Fi I e 34: Sci Sear ch( R) Q t ed Ref Sci 
(c) 2008 The Thomson Corp. All rts. reserv. 

16992811 Genui ne Art i cl e#: 218BJ No. References: 47 

Title: Neural characteristics of successful and less successful speech and 
word I earni ng i n adul t s 

Author(s): V\6ng PCM ( REPRI NT) ; Per r achi one TK; ParrishTB 

Corporate Source: Northwestern Uni v, Dept Corrrrun Sci & Disorders, Inst 
Neurosci , 2240 Campus Dr / Evanst on/ / I U 60208 (REPRINT); Northwestern 
Uni v, Dept Conrnin Sci &Disorders, Inst Neur osci , Evanst on/ / 1 U 60208; 
Northwestern Uni v, Dept Li ngui st , Evanst on/ / 1 U ; Northwestern Uni v, Cogni t 
Sci Program Evanst on// 1 U ; Northwestern Uni v, Dept 
Radi ol , Chi cago// 1 U 6061 1 ; Nor t hwest er n Uni v, Dept Bi omed 
Engn, Chi cago/ / 1 U 60611 

Journal: HUMAN BRAI N M\PPI NG, 2007, V28, N10(OCT), P995- 1006 

ISSN: 1065-9471 Publ i cat i on dat e: 20071000 

Publ i sher : W LEY- LI SS, DI V JCHN W LEY & SCNS I NC, 111 Rl VER ST, HCBCKEN, NJ 
07030 USA 

Language: English Document Type: ARTICLE ( ABSTRACT AVAI L ABLE) 
...Abstract: report a study assessing the neural correlates of learning to 
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use these pitch patterns in words by English- speaking adults who 
had no previous exposure to such usage. Before and after training, 
subj ect s di scr i rri nat ed. . . 
they learned while blood oxygenation levels were measured using f IvRI . 
Subjects who mastered the learning program showed increased 
activation in the left posterior superior terrporal region after 
t r ai ni ng, whi I e subj ect s who pi at eaued at I ower . . . 
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DI ALOG( R) Fi I e 34: Sci Sear ch( R) Q t ed Ref Sci 
(c) 2008 The Thomson Corp. All rts. reserv. 

09452328 Genui ne Ar t i cl e#: 405GR No. References: 112 
Title: Voice access of global information for broad- band wireless: 
Technologies of today and challenges of tomorrow 

Author(s): Lee LS (REPRINT) ; Lee Y 

Corporate Source: Natl Taiwan Uni v, Dept Elect Engn, Tai pei 1 061 71 1 Tai wan/ 
(REPRINT); Natl Tai wan Uni v, Dept Elect Engn, Tai pei 1 061 71 1 Tai wan/ ; Natl 
Taiwan Uni v, I nst Cormxin Engn, Tai pei 1 061 71 1 Tai wan/ 

Journal: PRCCEEDI NG3 CF THE I EEE, 2001, V89, N1 , SI (JAN), P41-57 

ISSN: 0018-9219 Publ i cat i on dat e: 20010100 

Pub I i sher : I EEE- I NST ELECTRI CAL ELECTRCNI CS ENGI NEERS I NC, 345 E 47TH ST, 

NEW YCRK, NY 10017-2394 USA 
Language: English Document Type: REVIEW ( ABSTRACT AVAI L ABLE) 

. . . Abst r act : appr oaches f or accessi ng t he net wor k wi I I be vi a human voi ce, 
and the integration of spoke language processing technologies with 
broad- band wireless technologies will be a key t o t he evolution of a... 

...paper of offers a vision of the above concept. Technical considerations 
and some typical exarrpl e applications of accessing the information 
and services ruing voice in a broad- band wireless environment are 
discussed. Fundamentals of spoken language processing technologies 
that are crucial in such a broad- band wireless environment pre briefly 
r evi ewed Techni cal . . . 

... I dent i f i er s- - CCNTI NUCUS- SPEECH RECOGNI Tl CN ; 

MUXI MJM LI KELI HOOD- ESTI IWTI CN; SOFTWARE RADI O ARCHI TECTURE; ENHAr^ED 
DATA RATES; SPEAKER ADAPT ATI CN; SPOKEN LANGUAGE ; WJRD 
RECOGNI Tl ON ; MODEL; EVOLim CN; I MT- 2000 
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DI ALOG( R) Fi I e 99: W I son Appl . Sci & Tech Abs 
(c) 2008 The HWWIson Co. All rts. reserv. 

1155784 H. W WLSCN RECORD NUM3ER: BAST94024366 
Voice recognition for Wndows 
AUGMENTED Tl TLE: ExecuVoi ce from Media Vision, Voi ceAssi st from Creative 
Labs, Voice Blaster fromCovox, and Rover fromDgital Soup 
Davi ds, Noah; 

Corrput er v. 27 (Mar. '94) p. 100-3 

DOCUMENT TYPE: Product Eval uat i on I SSN: 0018-9162 

Voice recognition for Wndows 

ABSTRACT: Four speaker -dependent, isolated word voice - recognition 

products designed for Mcrosoft Wndows are reviewed. The retail price of 
Medi a Vi si on' s ExecuVoi ce. . . 

...was the most useful feature, but holding the mcrophone was currtersome. 
He was able to start all the applications reviewed with ExecuVoi ce. 
However, he found Voice Blaster and Rover a little currbersome. 
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DESCRI PTCRS: . . . Aut orrat i c speech recogni t i on devi ces; 
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Dl ALOG( R) Fi I e 144: Pascal 

(c) 2008 INIST/CNRS. All rts. reserv. 

13671707 PASCAL No. : 98- 0379788 

Conmruni cat i on syst em f or peopl e wi t h physi cal di sabi I i t y usi ng voi ce 

r ecogni zer 

YASUDA S; 0KAM3T0 A; HASEGA\AA H; IvEKADA Y; KASUGA M; KAMATA K 
Ricoh Co, Ltd, Yokohama- shi , Japan 

Proceedings of the 1997 International Technical Conference on 
Q r cui t s/ Syst errs, Computers and Conmruni cat i ons, I TO CSCC (Okinawa, Jpn) 
1997-07- 14/ 1997-07- 16 

Journal: I El CE Tr ansact i ons on Fundamentals of Electronics, 
Comruni cat i ons and Computer Sciences, 1998, v E81-A(6) 1097- 1104 

Language: Engl i sh 

Conmruni cat i on syst em f or peopl e wi t h physi cal di sabi I i t y usi ng voi ce 

r ecogni zer 

... point of view, we propose a rail ti -modal comruni cat i on syst em t hat is 
composed of a voice recognizer , a pointing device, and a text composer. 
Thi s syst em i nt end t o i rrpr ove t he man- rrachi ne i nt er f ace f or peopl e wi t h 
physical disability. In this system our voice recognition technology 
plays a key role in providing a good interface between disabled people and 
t he. . . 

... to suffer from rruscul ar dystrophy. And he is able to move only his 
fingers and speak comrand words with the assistance of a respirator. 

English Descriptors: Voice recognizer ; Mj 1 1 i modal interface; 
Application ; Voice activated input devices ; Handi capp ed pe rsons; 
Electronic rrai I ; Interfaces (computer); Personal computers; Mil t i medi a 
systems; Database systems; Speech recognition ; Experiments 
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DI ALOG( R) Fi I e 483: Newspaper Abs Dai I y 

(c) 2008 ProQuest I nf o&Lear ni ng. All rts. reserv. 

05866547 SUPPLI ER NU^ER: 48485699 

This col unm was written Vi aVoi ce 

Levi t us, Bob 

Houst on Chr oni cl e, p 5 

Jan 28, 2000 

NEWSPAPER CODE: HC 

; Newspaper art i cl e 

LANGUAGE: Engl i sh RECORD TYPE: ABSTRACT 

. . . ABSTRACT: t est i ng I BM s new Vi aVoi ce M I I enni urn Edi t i on f or IVaci nt osh. 
This program is a new cont i nuous voice recognition program that 
understands words as you speak them then types t hem i nt o its built-in 
word processor, where they appear on screen... 

...the software took only a couple of rri nut es and a reboot. However, 
training Vi aVoi ce to recognize rry voice was another story completely. 
Wien you first start the program, it presents you with an "Analyze Mr 
Voice Assistant" that helps the program I earn to recognize your speech . 
You read a story and the pr ogr am I ear ns how your voice sounds when you 
speak those particular words . Unfortunately, the assistant froze twice 
during the training process, forcing a hard restart both times... 
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File 9: Busi ness &lndustry(R) Jul / 1 994- 2008/ Apr 24 

(c) 2008 The Gal e &oup 
File 15: ABI/ I nf orrrtR) 1 971 - 2008/ Apr 28 

(c) 2008 Prodiest I nf o&Lear ni ng 
File 16: Gale Qoup PRCMT( R) 1 990- 2008/ Apr 22 

(c) 2008 The Gal e Group 
File 20: Di al og G obal Reporter 1 997- 2008/ Apr 28 

( c) 2008 Di al og 
File 47: Gale Qoup Magazi ne DB( TM 1 959- 2008/ Apr 16 

(c) 2008 The Gal e group 
File 75: TG3 Management Contents(R) 86- 2008/ Apr W 

(c) 2008 The Gal e Group 
File 80: TGG Aerospace/ Def . M<ts( R) 1 982- 2008/ Apr 22 

(c) 2008 The Gal e Group 
File 88: Gale Q-oup Business ARTS. 1 976- 2008/ Apr 08 

(c) 2008 The Gal e Group 
File 112: UBM I ndust ry News 1 998- 2004/ Jan 27 

(c) 2004 United Business Media 
File 141: Readers Gji de 1983- 2008/ Mir 

(c) 2008 The HWWI son Co 
Fi I e 1 48: Gal e Gr oup Tr ade & I ndust r y DB 1 976- 2008/ Apr 09 

( c) 2008 The Gal e Gr oup 
File 160: Gale Q-oup PRCMT( R) 1972- 1989 

(c) 1999 The Gal e Qoup 
File 275: Gale Qoup Corrput er DB( TM 1 983- 2008/ Apr 21 

(c) 2008 The Gal e Qoup 
File 587: Jane' s Def ense&Aer ospace 2008/ Apr Wi 

(c) 2008 Jane's Information Qoup 
Fi I e 620: El U: Vi ewswi r e 2008/ Apr 25 

(c) 2008 Econorri st Intelligence Unit 
File 610: Busi ness Wre 1 999- 2008/ Apr 28 

(c) 2008 Busi ness Wre. 
Fi I e 613: PR News™ re 1 999- 2008/ Apr 28 

( c) 2008 PR Newswi r e Associ at i on I nc 
File 621: Gale Qoup New Pr od. Annou. ( R) 1 985- 2008/ Apr 10 

(c) 2008 The Gal e Qoup 
Fi I e 624: MbG aw- Hi I I Publ i cat i ons 1 985- 2008/ Apr 28 

(c) 2008 IVcQ aw- Hi I I Co. I nc 
File 635: Busi ness Dat el i ne( R) 1 985- 2008/ Apr 25 

(c) 2008 ProOiest I nf o&Lear ni ng 
File 636: Gale Qoup Newsletter DB( TM 1 987- 2008/ Apr 23 

(c) 2008 The Gal e Qoup 
File 647: CM 3 Corrput er Ful I t ext 1 988- 2008/ Mar W 

(c) 2008 CM 3 Midi a, LLC 
File 674: Corrput er News Ful I t ext 1 989- 2006/ Sep W, 

(c) 2006 I DG Corrrruni cat i ons 
Fi I e 696: DI ALOG Tel ecom Newsletters 1 995- 2008/ Apr 25 

( c) 2008 Di al og 
Fi I e 81 0: Busi ness W r e 1 986- 1 999/ Feb 28 

( c) 1999 Busi ness Wre 
Fi I e 813: PR Newswi re 1 987- 1 999/ Apr 30 

( c) 1 999 PR Newswi r e Associ at i on I nc 

Set 1 1 errs Descr i pt i on 

51 292846 ( SPEECH CR LANGUAGE CR VC1 CE CR AUDI TORY) ( 2N) ( RECOGNI Tl CN - 

CR RECOGNI Z? CR RECOGNI S? CR PROCESS? CR I NPUT? CR OUTPUT? CR 
SYNTHES?) 

52 435523 GRAMVAR CR VOCABULARY CR DI CTI CNAR? OR LEXI CON? 

53 8496659 E^RY CR E^RI ES CR WJRD? ? OR UTTERANCE? ? 

54 31514281 FI LE CR FI LES CR FOLDER? ? CR PROGRAM? ? CR PROGRAMME OR A- 

PPLI CATI CN? ? OR EXECUTABLE 
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55 15995033 USER CR USERS CR CLI ENT? ? CR SUBSCRI BER? ? 

56 629913 S5( 4N) ( SPEAK??? CR ENUNCI AT??? CR VCCALI Z??? CR SAYS CR ST- 

AT??? CR UTTER???) 

57 2853418 S4( 3N) ( CPEN??? CR RUN CR RUNS CR EXECUT??? CR START??? CR - 

LAUNCH??? CR ACCESS??? CR ACTI VAT???) 

58 1 59972 ( VO CE CR LANGUAGE) ( 2N) ( CCNTRCL? CR ACTI VAT? CR CPERAT? CR 

CCMvAND?) 

59 1 37291 3 CPERATI NG( ) SYSTEM 

510 25 AU=(M\JCR, A? CR IVAJCR A? CR \AANDI NGER, M? CR \AANDI NGER M?) 

51 1 0 S1 0 AND S1 

512 0 S10 AND S9 

513 527230 S3(15N)S4 

514 19204 S1(8S)S2 

515 4702 S6(20N)S7 

516 44 S14(8S)S15 

517 23 RD ( uni que i t ems) 

518 4738 S8(20N)S9 

519 64656 S3(5N) ( LI NK??? CR MATCH??? CR ASSI GN??? CR DESI GNAT??? CR - 

ALLOCAT???) 

520 9201 S19(20N)S4 

521 114 S14(8S)S18 

522 27 S21(8S)S7 

523 16 RD ( uni que i t ems) 

524 16 S23 NOT S16 

525 10763 S1(30N)S2 

526 88 S20( 8S) S25 

527 32 S26( 8S) S7 
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Dl ALCG( R) Fi I e 1 5: ABI / 1 nf or m( R) 

(c) 2008 ProQuest I nf o&Lear ni ng. All rts. reserv. 

01414543 00065530 

Dragon Systems: Continuous voi ce- recogni t i on software 

O Hei r, Jeff 

Corrputer Reseller News n730 PP: 16 Apr 7, 1997 
ISSN: 0893-8377 JRNL CODE: CRN 
VCRD COUNT: 504 

TEXT: DRAGON SYSTEMS today unveiled Naturally Speaking, a continuous voice 
- recognition software application geared toward general and vertical 
corporate markets. For VARs, the i rtpr overrent s i n t he technology could help 
j urrp- start speechr ecogni t i on applications into mai nst r earn busi nesses. 

Dur i ng a derronst r at i on at the W>r I d Tr ade Cent er hel d I ast week i n New 
York, Naturally Speaking al I owed t he user t o do j ust that. 

"I ... used ... to ... talk... I i ke . . . this. ... Che ... word ... at ... a 
... t i me. . . 

...each word and even allowed Gould to verbally correct or change words, 
select from a dictionary and choose fonts. The software hardly failed at 
picking up the correct words and even... 
...user's voice, officials said. 

"From a reseller perspective, this is t he Hol y Q- ai I of speech 
recognition . This is pr i rre t i rre, " said M chael Mardi ni , vice president of 
Tal k Technol ogy I nc. , a New. . . 

...sales to grow beyond vertical markets. 
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Mardini and Lai t h Am it y, director of Talk Wit e, a speech - recognition 

systems integrator in Oxford, England, said they each sold about 1,000 
uni t s of Dr agonDi ct at e. . . 
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Dl ALOG( R) Fi I e 1 5: ABI / I nf or m( R) 

(c) 2008 ProQuest I nf o&Lear ni ng. All rts. reserv. 

00828273 94-77665 

Survey of current speech technology 

Rudni cky, Alexander I; Haupt rrann, Alexander G; Lee, Kai - Fu 
Corrrruni cat i ons of the ACM v37n3 PP: 52-57 Mar 1994 
ISSN: 0001-0782 JRNLOCDE: ACM 
WJRD COUNT: 3100 

TEXT: Speech recognition and speech synthesis are technologies of 
particular interest for their support of direct corrrruni cat i on between 
humans and corrput er s. . . 

...at which they are highly skilled. Both manipulate speech i n t er rrs of its 
information content; recognition transforms human speech i nt o t ext to 
be used I i t er al I y (e.g., for dictation) or interpreted as commands to 
control applications, and synthesis allows the generation of spoken 
ut t er ances f r om t ext . 

Aut orrat i c speech recognition (ASR) has gained a significant amount of 
commercial success due to its demonstrable increase in... 

...ASR are diction, personal corrput er interfaces, automated telephone 
services, and special purpose industrial applications. 

Large vocabulary dictation: "unrestricted" dictation (e.g., business 
letters or newspaper articles) and structured report generation (e. .. 

...with the system for an extended period of time. Di scr et e- wor d systems 
are prevalent. Current vocabulary capacities range up to 40,000 words. 
These systems are meant to be used under... 

. . . syst em i s i ncr eased by its abi I i t y to adapt t o an i ndi vi dual ' s voi ce 
(speaker adaptation), vocabulary (new word learning), and tasks (language 
model adapt at i on) . 

For interfacing with personal computers, the "electronic... 
...computer interface. Some of the near er - 1 er m oppor t uni t i es include: 

* Speech as a shortcut. Rather than opening a file by traversing many 
levels of hierarchy, a user just says " CPEN BUDGET. " 

* Hands busy/ eyes busy. Change the font style while a user is typing... 
...use or even nonexistent, thereby making speech a competitive 

al t er nat i ve. 

The main challenge for desktop speech recognition i s t he current 
existence of a mature and efficient alternative --the keyboard (and 
poi nt er ) . It... 

...management, appropriate user feedback, as well as integration into the 
comput i ng envi r onment . 

In the future, speech recognition night be corrbi ned with natural 
language processing in more arrbi t i ous tasks, such as information 
retrieval or delegation interfaces. The proper use of... 
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... 20 wor ds. Si gni f i cant f unct i onal i t y does not r equi re I ar ge vocabul ar i es; 
sorre currently deployed systems provide a vocabulary size of two words 
("yes" and "no"). In addition to poor control over signal quality... 

...prorrpts (known as "barge in"), or they will errbed their responses in 
other, out-of- vocabulary speech ("yes, please"). Wud- spot t i ng techniques 
can be used to achieve acceptable recognition accuracies [15... 
...and are arrenabl e to a nurrber of const r ai nt - based techniques that 
increase the success of voice input . Speaker - dependent recognition is 
suf f i ci ent f or many appl i cat i ons, si nee a part i cul ar devi ce wi I I be used. . . 
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Dl ALOG( R) Fi I e 1 5: ABI / I nf or m( R) 

(c) 2008 ProQuest I nf o&Lear ni ng. All rts. reserv. 

00622631 92-37733 
Voice Processing Systems 

Hel ms, Q enn L. 

Internal Auditor v49n3 PP: 62-68 Jun 1992 
I SSN: 0020- 5745 J RNL OCDE: I AU 
WORD COUNT: 3062 

...TEXT: keying in a password. Any individual who has knowledge of the 
password can gain unauthorized access to files . In order to access a 
voice system the user is required to speak a password. Avoiceprint is 
created from the spoken password and compared with a stored... 

. . . and r et ai I st or es, have r out i ne cust orrer i nqui r i es answer ed by a voi ce 
system Ivany banks have voice output systems which per irit a customer to 
i nqui r e as to t he bal ance i n a checki ng account . . . 

...amounts, payment due dates, and amounts and types of coverage. Some 
major retail chains utilize voice output to contact customers when 
rrai I - or der i t ems have ar r i ved and can be pi eked up at . . . 

. . . execut i ng ot her t r ansact i ons at anytime, rather t han j ust during 
r egul ar busi ness hour s. 

* CCWRCLS OVER VOI CE I NPUT 

Speech recognition systems are not i mrune to problems. For exarrpl e, 
certain user speech characteristics rri ght affect the... 

...user option rri ght be to quickly retrain a speaker - dependent syst em t hat 
has a smal I vocabul ary . 

Speaker - i ndependent systems that are available t o t he general public 
probably should have a I i rri t ed vocabulary . Inadequate grarmrar skills 
and colloquialisms can create problems, such as when a user states: 
" Tr ansf er f our hunder d. . . 

...as "Transfer four hundred dollars and twenty five cents from savings to 
checking." If a speech recognition pr obi em occur s in a 
speaker - i ndependent syst em avai I abl e for the general public, processing can 
be. . . 

...noisy office or factory environment can have an adverse effect on the 
reliability of some speech recognition systems. In such cases, 
developers should consider acquiring directional rri cr ophones which filter 
out irrelevant background sounds. 

Speech recognition systems also have behavioral i rrpl i cat i ons. Users who 
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train a speaker - i ndependent syst em t ypi cal I y talk in a more formal voice 
when they construct a vocabulary . Wien the syst em becomes operational and 
t he user speaks in hi s/ her nat ur al voi ce wi t h. . . 

...technology suggest that users may require an inordinate amount of t i me 
to create a large vocabulary in a speaker - dependent system and that the 
trainer night forget which words are contained in a large vocabulary 
thereby creating frustrating situation. These critics call for a standard 
vocabulary to be used across systems. For exarrpl e, "rub out" consistently 
means "delete" whenever a user... 
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Dl ALOG( R) Fi I e 1 6: Gal e G" oup PRCMT( R) 
(c) 2008 The Gale Q-oup. All rts. reserv. 

06151018 Supplier Nurrber : 53939885 ( USE FORMAT 7 FCR FULLTEXT) 
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Speech recognition software and systems company Nuance Comruni cat i ons 
I nc introduced two new products Monday as part of... 

...or voi ce- act i vat ed corrrrerce) Alliance - has announced Nuance 6.2, an 
upgrade to its natural language speech recognition server, as well as 
Nuance Express, which is billed as a low- cost offering built... 

...product also supports Nuance SpeechCbj ect s( Tlv) , a set of reusable 
components for rapid devel oprrent of speech recognition applications. 
Ct her enhancement s include single phrase correction and the ability to 
personal i ze appl i cat i ons on t he. . . 

. . . ent ry- 1 evel applications. It is targeted at less corrpl ex, menu-driven, 
systems that require a vocabulary of less than twenty items and is said 
to al I ow pr ogr arrrrer s with no prior speech... 

... a cost - ef f ect i ve upgr ade pat h f or a cust orrer I ooki ng to st ar t wi t h an 
entry- 1 evel speech recognition syst em built on Nuance Express and then 
gradually migrate to a more sophisticated, larger vocabulary syst em based 
on Nuance 6.2. Developers only need to learn one API set and... 

...more corrpl ex application components to an entry- 1 evel syst em as 
required. Installation of a new speech recognition server is not 
needed, the company says, and a credit towards corrpl et e Nuance 6.2... 

...start at $3,000, and are priced according to the corrpl exity and capacity 
of the application . Nuance Express, meanwhile, starts at $250 per 
concurrent user . The company says both products are available for 
W ndows NT, Sun Sol ar i s, I BM Al X, SCO Uni x and □ gi t al . . . 
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Five years ago, speech recognition required special hardware, a 
I ot of money, and a person wi I I i ng t o. . . t al k. . . I i ke. . . t hi s, usi ng a very 
United vocabulary . Not surprisingly, it wasn't a bi g seller. During the 
first half of this year, however, unit sales of speech - recognition 
products increased more than 100 percent, with revenue growing 200 percent 
over I ast year ' s. 

Speech - recognition software is clearly on a fast track to 
beconi ng a legitimate consumer item It is making inroads everywhere. \A6rd 
processing , natural language search programs, form entry, transcription, 
medical and legal record systems, corporate applications, e-mail and 
Internet chat, and aut orrat ed telephone services are all being targeted by 
new pr oduct s. 

" Speech recognition has a 'gee- whiz' factor," says Roger Mat us, 
vice president of marketing at Dragon Systems... 

. . . sorret hi ng new. " 

I BM s Vi aVoi ce 98 and Dragon Systems' Nat ur al I ySpeaki ng 3.0 are the 
leading speech - recognition programs on the market. Each company has 
products for less than $60. Both use continuous rather than isolated speech 
and can handle an extensive vocabulary , and both offer additional 
software that handles technical terms specific to certain occupations. 

As the speech - recognition software market gets crowded with 
companies and products, and as the technology improves, I BM and. . . 

...at IBM sees irrproved accuracy, increased ease of use, and lower prices 
as having brought speech - recognition software to this point. 

" Thi s is a gr eat , exci t i ng t echnol ogy, " she says. "It's usef ul . . . 

...into Standard, Preferred, and Professional versions. Both corrpani es have 
also released legal - and rredi cal - speci f i c vocabulary modules. 

Vi aVoi ce 98 Cf f i ce Edi t i on of f er s speci f i c busi ness and f i nance 
topics to add t o t he base vocabulary , and the Executive version adds 
another topic, corrput i ng. For the Horre product, I BM i ncl uded a cuisine 
t opi c. 

"The topics differentiate the products," London says . "For novice 
users of Horre, where teenagers would be working on book reports, Vi aVoi ce 
is a si rrpl e di ct at i on appl i cat i on . Cf f i ce and Execut i ve of f er rror e 
involved navigation tools." Dragon offers business and corrput er vocabulary 
rrodul es separ at el y. 

Dragon Systems' big new release for the holiday season is 
Nat ur al I ySpeaki ng for Teens, a speech - recognition program ai ired 
directly at young adults. Dragon Systems went out to school districts and 
set . . . 
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I ever agi ng t hei r exi st i ng ski I I s to br i ng t hei r cust orri zed cont ent 
from the Wb t o t he rrobi I e user . 

A voi ce- act i vat ed f I i ght st at us ser vi ce is one exarrpl e of a VoxlvL 
application that provides convenient access t o t i rrel y i nf or mat i on. A 
user cal I i ng in to access ai r I i ne i nf or mat i on si rrpl y says, "Is t he. . . 

. . . may i nt er f ace wi t h encapsul at ed di al ogue corrponent s such as 
SpeechCbj ect s from Nuance Conmini cat i ons, the leader in speech 
recognition for self-service transactions. SpeechCbj ect s are reusable 
speech corrponent s that incorporate the potentially conpl ex dialogue and 
vocabulary that may be used in speech applications. 

"VoxlvL adds enormous potential t o t he development of... 
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ABSTRACT: 

. . . I BM Cor p. is a I ess- expensi ve product than the corrpany' s Voi ceType 
Dictation 3.0 speech recognition , which is more powerful. Si rrpl y 
Speaking will be favored by nontypists over one- finger keyboard... 

...and Voi cePad, which is a modified version of Mcrosoft Wndows' WirdPad, 
that translates what the user says into text. Unlike Voi ceType 
Dictation, Si rrpl y Speaking does not al I ow you to start applications by 
voice or to dictate sentences into a word processor. The Voi cePad mist be 
ut i I i zed. . . 

...to t r ansl at e voi ce corrrrands i nt o t he t ext . I BM says t hat Si rrpl y Speaki ng 
has a 90% speech recognition capability. Although conjunctions, 
prepositions, nouns, and verbs are handled well by the 20, 000- word 
dictionary , the package has difficulty recognizing other tenses. The 
package, priced at $50, has potential but... 
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usi ng t he sof t war e i rrrredi at el y wi t hout t r ai ni ng. 
"All the corrrrands can be spoken the moment you launch the voice 

recognition application ," says Robert Hayes, President of Cyl ogi c. 

Hayes also says that users with deep, or heavy foreign accents 
can i nst ant I y ' Qui ck Tr ai n' t he sof t war e (52 wor ds) wi t h. . . 
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Spel I . 

IVbdern speech technology can be broken down into three categories: 
products that let computers synthesize speech from digital input ; 
those that let them recognize (or enter) speech from out side sources; 
and t hose t hat corrbi ne and bui I d on t he f i rst t wo cat egor i es to... 

...available for the enabling technologies market segment, whose users are 
of t en on a t i ght budget . 

Speech synthesis technology produces spoken output either from 
text appearing on a corrputer screen or in response... 

...spreadsheets by having the corrputer read aloud ranges of nurrber s. 
Visually irrpaired users can use speech synthesis technology to access 
data and other information displayed on the computer screen. Ivbst speech 
synthesis systems also digitally record, store, edit and play back speech 
- useful for sound effects and sending verbal electronic mail. Many systems 
also offer I i rri t ed speech recognition that lets users verbally enter 
sorre software corrrrands. 

VARs report surprisingly good results working with even the low- end 
speech synthesis products. 'Until I beta tested the Wndows Sound 
System I thought it was a toy... 

...a long download session, that you rri ght otherwise rri ss. ' 

Elton B. Sherwi n Jr., manager of speech recognition strategy at 
I BM, says even VARs that are not primarily interested in reselling speech 
syst errs. . . 

... t he corrpet i t i on at a r el at i vel y I ow cost , ' he says. 

In contrast t o t he I ow- end speech synthesis the high- end products, 
such as Digital Equi prrent Corp. 's DECt al k PC Speech Synthesizer and 
Errer son & St er n Associ at es I nc. ' s Sound Byt es, ar e I i kel y t o be used i n 
settings where speech i s t he primary input mechanism These products offer 
faster speaking rates, higher speech quality, user -definable 
dictionaries and rrore sophisticated pronunciation, punctuation and 
granmrar options, among the other features. They are typically used with 
screen access programs capable of feeding t hem i nf or mat i on from the 
corrputer screen. Examples include Tel eSensor y' s ScreenPower, Henter- Joyce 
I nc. ' s JAV\S, and, for Macintosh, Berkeley Syst errs I nc. ' s out SPOKEN. 

Speech recognition isn't a new i dea, but it has gone beyond the 
j ust - 1 al k st age onl y. . . but it t ur ned out t o be a di f f i cul t pr obi em Speech 
is tremendously corrpl ex. ' 

For decades, speech recognition was restricted mainly to high- end 
vertical markets with I i rri t ed vocabulary , hi gh- 1 hr oughput data entry 
needs - for exarrpl e, insurance corrpani es and medical recording firms. In 
t he. . . 

...and efficient, and processing power dropped in price, changing the 
mar ket . 

Cberteuffer predicts that once speech recognition crosses an 
accur acy/ r obust ness/ pr i ce t hr eshol d, there will be wi despr ead, i rrrredi at e 
acceptance. He even suggests... 
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. . . unt i I a corrbi nat i on of ci r currst ances caused t hei r pr i ces to dr op dur i ng 
t he 1980s. ) 

Still, current speech recognition systems, although vastly better 
t han t hose avai I abl e even a f ew years ago, f al I f ar short . . . 
...and performance. A corrpr ehensi ve 1993 report by San Francisco market 
anal ysi s f i r m Fr ost & Sul I i van st at es: ' Voi ce recogni t i on i s t he I east 
developed of voice technologies and is widely considered t o be a technology 



...its i nf ancy. I ndust r y exper t s agr ee i t wi I I be at I east a decade bef or e 
a mature voice recognition product is introduced in the market.' 

IVbst rridrange speech recognition systems are speaker - dependent ; 
t hey r equi r e ' t r ai ni ng' - vi a t he t edi ous r epet i t i on of key wor ds and. . . 

...intonation and vocal patterns of particular speakers. Ivany also fall 
short of providing truly continuous speech recognition because users 
mist insert a brief pause between words. They also tend to be overly... 

...the corrpany offers. IBM Kurzwei I Applied Intelligence Inc. (Vteltham. 
Ivass.), Dragon Systems Inc. (Newton, Ivass.), Voice Processing Corp. 
(Carrtridge, Ivass.), Dialogic (Parsippany, N. J.), Rhetor ex ( Carrpbel I , 
Cal if-) and Ler nout & Hauspi e ( Br ussel s, Bel gi urr) ar e j ust a f ew of t he 
companies that license or resell speech - recognition algorithms, boards 
and other components to resellers. 

Che of the more appealing aspects of speech recognition involves 
Interactive Voice Response ( I VR) technology. Ideally, an I VR syst em woul d 
blend both speech synthesis and speech recognition to create an 
easy-to-use aut orrat ed response system - say, a f ax- on- demand, information 



...without further human intervention. 

In practice, however, there are I irritations not just in present-day 
speech recognition technology, but those inherent in the current 
telephone infrastructure. Telephones distort speech and introduce noise, 
rraki ng over - 1 he- phone speech recognition even more difficult than the 
ordinary kind. Therefore, most current I VR systems rely on touch-tone, not 
spoken, user input. ( Cver - 1 he- 1 el ephone speech recognition is a 
critical issue in Europe and rruch of the rest of the world, since... 
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of t he expanded rrerrory boards is dramatically less than that of 
previously available, corrparable large vocabulary systems and 
boards. " 

The expanded vocabulary options increase support for mil t i pi e 
speakers and ml t i pi e applications. Verbex's "speaker flexible" 
software allows any authorized rrerrber of a department to speak into 
any conversational voice input 
rricrophone unit. Each operator can use 

a nunter of different m cr ophones at different I ocat i ons i n t he wor k 
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area, and can access more than one application 
f r om any rri cr ophone. 

Any department user can independently activate any station and 
instantly access his/her voice file 
. Af t er a br i ef i ni t i al t r ai ni ng 

session, the speakers' voice patterns can be permanently stored on. 
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ABSTRACT: 

...use a number of different rricrophones at different locations in the work 
area, and can access more than one application from any rri cr ophone. Any 
depart rrent user can independently activate any station and instantly 
access hi s/ her voi ce file. The f I exi bi I i t y of t he Voi ceSt at i on of f er s 
the advantages of "speaker independence," plus high speed, accuracy, and a 
vi r t ual I y unl i rri t ed vocabul ar y . The GUI C Voi ce Qual i t y I nspect i on and 
Control Syst em r ecogni zes trillions of sentences, comrands and unl i rri ted... 

...Verbex syst em i s language independent and understands regional accents 
and foreign languages. It also accurately recognizes variations in 
speech during high throughput, high noise or other st r ess- i nduced 
condi t i ons. 
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and are then invited to practise the chosen language through a 

series of corrpr ehensi on questions, vocabulary and grarrrrar exercises. 

Each lesson is suppl errent ed by orthographical and grarrmatical explanations. 
The f i rst Engl i sh I esson is... 
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self-service transactions. SpeechCbj ect s are reusable speech 
components that incorporate the potentially corrpl ex dialogue and 
vocabulary that may be used in speech applications. "VoxM_ adds enormous 
potential t o t he development of... 
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including Kur zwei I Applied Intelligence Inc., Dragon Systems Inc., 
Speech Systems Inc., Philips Dictation Systems Inc., Voice Processing 
Cor p. , and Ti rrewor ks I nc. wi I I t ar get t hi s ar ena wi t h new voi ce - 
recognition products. 

But the largest obstacle to adoption of voice recognition is 
i nt egr at i ng i t i nt o i nst al I ed appl i cat i ons. To srroot hi y use voi ce 
recognition in an application requires a user to speak naturally; for 
exarrpl e, saying " Open new file and name it xyz, " rather than having to 
incorporate a one- second delay between each... 

. . . goal , Kur zwei I i s pr epar i ng t o announce an al I i ance wi t h vendor s 
including WordPerfect Corp. to integrate speech recognition into 
appl i cat i ons. Thi s wi I I enabl e users t o use voi ce rror e srroot hi y, sai d 
of f i ci al s of t he. . . 

. . . Crem Ut ah. 

At Comdex, Speech Systems will debut Phonetic Engine 500, a 
speaker - i ndependent , continuous speech - recognition software package 
with a large vocabulary . The $595 product uses an add- i n ISA/ EISA card 
wi t h an on- board di gi t al . . . 

...Phonetic Engine 500 Syst em Devel oprrent Kit costs $995. 

Philips will debut Speech Magic, its continuous speech - recognition 
sof t war e due t hi s f al I . Voi ce Processi ng wi I I debut a ver si on of its 
voice - recognition software for corrrrand and control of applications. 

Kur zwei I wi I I al so derronst r at e t he W ndows ver si on of . . . 

. . . Newt on, Mass. , company. 

Fi nal I y, Ti rrewor ks wi I I i nt r oduce at t he show i t s Ti mewor ks Say I t 
Command Voice Recognition System whi ch wi I I be priced at less than 
$300 when it i s rel eased i n June. . . 
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of next year. "The mar ket has taken us by surprise," adrrits Elton 
Sherwi n, manager of speech - recognition strategy for I BM Power Personal 
Systems. "We're a year ahead of where we expected... 

. . . f urt her up t he f ood chai n. Hi s program Voi ce User , si t s at op Voi ce 
Pilot, the Mcrosoft program that enables voice activation of programs 
within the W ndows Sound System Voice User , he says , extends its 
rudimentary vocabulary and corrmand set and improves the interface. Hayes, 
of course, hopes that CEIvs will bundle... 
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aut omat i cal I y. 

Wat hubr is. By t he ni d- 1960s t he sci ent i st s r eal i zed t hat I i ngui st i c 
talent goes beyond lexicons and parsing. Typical of the output of a 
computer translator: "Cut of sight, out of... 

...the U.S. subsidiary of Bravice International, a Japanese corrpany, closed 
down last Sept errber . Automated Language Processing Systems in Salt Lake 
City, which does corrput er - assi st ed translation, is the only firm in... 

...The problem is that language translation i s f ar subtler than si rrpl y 

substituting words from a dictionary . To translate, a machine must 

understand; to understand, it mist have not just a dictionary and rules 

of syntax, but also corrrron sense. "The green house on the hill" would... 

...a rront h to lease software for one language pair, such as Engl i sh- Fr ench. 
The enormous programs run on I BM mai nf r ames and Wang rri ni corrput er s, 
among other systems. Logos says that a user whose translation needs 
exceed 3,000 pages a year would save by using a corrput er .. . 
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onto accounting professionals' desks. 
WHAT I S VCI CE RECCGNI Tl CN SOFTWARE? 

Si rrpl y, voice recognition software allows corrput er users to speak 
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corrrrands into their corrput er s- - such as " open file "--via a mcrophone to 
corrpl et e t he t ask. 

Voi ce r ecogni t i on syst errs r ange f r om hi gh- end appl i cat i ons. . . 

... on f unct i onal i t y. 

M crosof t Off i ce XP, 
of f i ce. rri cr osof t . com' en- us/ assi st ance/ HA01 05651 1 1 033. aspx, possesses basi c 
speech recognition capabilities, which can be an i nexpensi ve way to 
experiment with the technology. 

Wint greater capabilities... 
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Aut orrat i c speech recognition ( ASR) has gained a significant 
amount of corrrrercial success due to its derronst r abl e increase in... 

. . . ASR ar e di ct i on, personal corrput er interfaces, aut orrat ed t el ephone 
services, and special purpose industrial applications. 

Large vocabulary dictation: "unrestricted" dictation (e.g., 
business letters or newspaper articles) and structured report generation (e 



...with the syst em for an extended period of time. Di scr et e- wor d systems 
are prevalent. Current vocabulary capacities range up to 40,000 words. 
These syst errs are rreant to be used under... 

. . . syst em i s i ncr eased by its abi I i t y to adapt t o an i ndi vi dual ' s voi ce 
(speaker adaptation), vocabulary (new word learning), and tasks (language 
model adapt at i on) . 

For interfacing with personal corrputers, the "electronic... 

... corrput er interface. Sorre of the near er - 1 er m oppor t uni t i es include: 

* Speech as a shortcut. Rather than opening a file by traversing 
many levels of hierarchy, a user just says " OPEN BUDGET. " 

* Hands busy/ eyes busy. Change the font style while a user is typing 



...use or even nonexistent, thereby making speech a corrpet i t i ve 
al t er nat i ve. 

The main challenge for desktop speech recognition i s t he current 
existence of a mature and efficient al t er nat i ve- - 1 he keyboard (and 
poi nt er ) . It... 

...whether the user's intent was identified and the correct action taken). 

The availability of speech recognition technology has encouraged 
the development of prototypes that merge speech capabilities i nt o a 
gener al - pur pose corrput er interface [12, 13]. Research on the usefulness of 
speech recognition for various activities is also under way. For the 
most part, work in this area... 
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. . . and has not been abl e to expl or e the capabi I i t i es of mor e advanced 
recognition technology [9]. 
Synt hesi s 

Speech synthesis offers an output channel in cases where visual 
displays are either not possible, insufficient, or awkward (such as over 
the telephone or while driving). The goal for most speech synthesis 
systems is to produce speech of useful intelligibility; the ultimate goal 
is to pr oduce speech. 

. . . of corrpl et e sent ences i s agai n over f i ve t i rres as I ar ge as t hat of 
natural human speech (4.7% synthesized vs. 0.8% for natural speech) 
[11]. Furthermore, these intelligibility studies were conducted in a... 
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each word and even allowed Gould to verbally correct or change 
words, select from a dictionary and choose fonts. The software hardly 
f ai I ed at pi cki ng up t he cor rect words and even. . . 

...user's voice, officials said. 

"From a reseller perspective, t hi s i s t he Hoi y Gr ai I of speech 
recognition . This is prirre time," said M chael Mardi ni , vice president of 
Talk Technology Inc., a New Yor k- based reseller of integrated speech - 
recognition systems. 

Resellers expect product sales to grow beyond vertical markets. 

Ivardi ni and Lai t h Arri ry, director of Talk Wit e, a speech - 
recognition systems integrator in Oxford, England, said they each sold 
about 1,000 units of Dragon... 
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great deal of investigative effort. 
Technically, true continuous recognition allows for both free- form 
spoken input and natural speech rhythms. Continuous availability means 
that a speech - recognition syst em i s constantly "listening" t o t he user 
and is always able to assign the proper... 

. . . short t er m 

Continuous recognition requires avast amount of statistical dat a on 
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the speaker's vocabulary and idiosyncratic habits of expression. 
Cont i nuous avai I abi I i t y suf f er s f r om a si rri I ar def ect i n t hat it... 
. . . exact i ng, ponderous sol ut i ons. 

Recall that Voice Tools' controls rely on the I BM Voi ceType engine for 
speech recognition . This woul dn' t be good news if it also meant that 
the controls demanded discrete- speech input . However, most of Voice 
Tools' controls take advantage of the ci r currst ances relevant to using 
spoken. 

...to prevent your application f r om behavi ng erratically and to ensure that 
it doesn't receive speech input at inappropriate times, even if the 
user forgets to issue the corrrrands that turn off the rricrophone. 
Get t i ng St art ed 

Speech processing is a relatively corrpl ex and pr ocessor - i nt ensi ve 
task; therefore, it's irrportant to start designing... 

. . . wi I I need a 90MHz Pent i urn pr ocessor or f ast er , 50IVB of har d dr i ve space 
for the speech - recognition package, at least 16M3 of RAM a rricrophone, 
speakers, and a Sound Blaster 16 (or... 

...that contains the Sonar Tool a nonvi si bl e form whose role is to 
orchestrate and control voice processing , so i t ' s t ypi cal to set the 
Start Visible property to False in the Load... 

. . . St ar t For m You' I I set up addi t i onal vi si bl e f or rrs to const r uct an 
appl i cat i on t hat i ncl udes voi ce - r ecogni t i on cont r ol s to handl e j obs 
such as managing an application or f i I I i ng i n f or rrs. 
The pr ogr arrrri ng. . . 

...visual components is simple and straightforward, mostly amounting to the 
use of a handful of speech - processing properties to define the behavior 
of the controls. The Say PI ay and Say Record properties specify the corrmand 
word a user trust speak to start a sound file 's playback and 
recording respectively, and PI ay I nt er r upt abl e deter rri nes whether a sound 
file may be i nt er r upt ed. . . checki ng t he st at us of var i ous el errent s of the 
application, such as the rricrophone and the vocabulary database. 

Per haps of t he most i rrrredi at e i nt er est i s t he Voi ce Ter rri nal ' s Wiat 
Can I . . . 

...the recognition engine new words. Cut of the box, the I BM r ecogni t i on 
engine has a vocabulary of 20,000 words. Users can add up to 2,000 words 
usi ng t he Voi ce. . . 
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word depends on its context rather than its sound. 
It is for these reasons that speech - recognition technology, one of 
the oldest and most intriguing fields of corrput er research, has been slow 



...of human cognition, imagine how di f f i cul t it i s f or engineers to 
translate or rri rri c t he processes of speech and hearing with corrput er 
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technology. It i s by no means i rrpossi bl e, however -- some major progress has 
been made in the area of speech recognition . Here we look at the 
technology behind such products and review an exceptional new speech - 
recognition product from Dragon Systems. 
The Spoken W)rd 

I n ot her corrput i ng appl i cat i ons, a pr obi eni s corrpl exi t y. . . 

...capacity of the hardware and software tools created to solve them The 
challenges linked to speech recognition , however, have remained 
constant; after all, the fundamental aspects of speech haven't changed over 
the past 20,000 years or so. Today's designers of speech - recognition 
applications struggle with the sarre basic obstacles faced by the pioneering 
researchers. The applications they... 

. . . Fi nal I y, t hey mist mat ch t he word const r uct i ons t o a I i st of known words 
in their vocabulary to produce text or detect meaning. 

I f you' ve f ol I owed t hi s t echnol ogy' s devel opment , you. . . 

. . . corrmer ci al I y available products and see how t r ade- of f s among them 
i nf I uence qual i t y and usabi I i t y. 

Since speech recognition i s a f ai r I y esot er i c t echnol ogy, it is 
helpful first t o di scuss di f f er ences among avai I abl e pr oduct s. . . 

...or precede them In fact, recent research that studied the performance 
of a high-quality speech recognition engine showed that its most 
frequent processing errors were associated with the most corrmon 
monosyllabic words; this pattern held across six major languages. By 
contrast, continuous- speech recognition systems al I ow t he speaker to 
use a more natural cadence. It's a great advantage... 

. . . di ct at i on on a t ypi cal deskt op syst em 

A speaker -dependent recognition syst em must be "trained" to recognize 
the voice and speech habits of individual users. Generally, this 
i nvol ves r eadi ng t he syst em a st andard t ext . . . 

. . . wi t hi n a f ew weeks. Such syst ems have t he advant age of al I owi ng user s t o 
i ncr ease t hei r vocabulary dynarri cal I y; their disadvantage is that they 
cannot be readily shared among many users . By contrast, speaker 
-independent syst errs require no training and can be used in public- access 
appl i cat i ons . 

Another distinguishing factor i s t he si ze of a recognizer's active 
vocabulary . Wii I e a syst em may be capable of recognizing a large variety 
of words, the key consideration is its "active" vocabulary -- how many 
different words it can distinguish at any given time. Some systems can 
di st i ngui sh. . . 

...identify words based on context. 
Fragments and Phrases 

Che of the most exciting developments in speech recognition i s t he 
advent of reliable " sof t war e- onl y" solutions, which take advantage of a 
corrput er ' s. .. speaker - dependent engine such as Dragon's, the first step is 
the construction of a vocabulary . The built-in Dragon vocabulary 
consists of about 30,000 words (in the classic edition), which are modeled 
usi ng t ool s. . . 

...speakers to record various texts. 

Cnce they have accurrulated a large database of wavef or m sarrpl es, 
speech signal - processing experts look at frames of speech data and hand 
label the groups of frames that... 

...to frames and phonemes, more and more of the recognition is handled 
automatically by the vocabulary -generation tools. Human intervention is 
requi red onl y to correct errors in a f ai r I y smal I nurrber . . . 
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...in t he 25, 000 wor ds of t he Dr agon Di ct at e' s r ecogni t i on engi ne. 
( I nt er est i ngl y, i ncr easi ng t he vocabul ar y ' s si ze to 64, 000 wor ds only 
increases the total count of PI Cs t o 32,000, and decreasing the vocabulary 
to 8,000 words reduces the nurrber of PI Os t o only 1,000.) Wth this... 

...sound to construct its Hidden IVar kov Model. The Mar kov IVbdel , a 
foundation technology for most speech - recognition engines, produces a 
corrpact representation of a "piece- wise" constant approximation t o t he 
spect r ogr am 
Dr agon. . . 



17/3, K/ 22 (Item 1 f romf i I e: 636) 

DI ALOG( R) Fi I e 636: Gal e Q-oup Newsletter DB( TM) 
(c) 2008 The Gale Q-oup. All rts. reserv. 

03992355 Supplier Nurrber: 53103795 ( USE FORMAT 7 FCR FULLTEXT) 
PROGRAMMERS LEARN TO TALK AGAI N WTH M3TCRCLA' S VCXM_ LANGUAGE. 

Voice Technology & Services News, v17, n21, pNA 
Oct 13, 1998 

Language: English Record Type: Full text 
Document Type: Magazi ne/ Jour nal ; Trade 
Wjrd Count : 900 

i n a mat t er of days, I ever agi ng exi st i ng skills to del i ver 
custom zed Wb content to mobile users . 

A voi ce- act i vat ed f I i ght st at us ser vi ce is one exarrpl e of a VoxM_ 
application that provides convenient access to information. A user 
want i ng ai r I i ne i nf or mat i on says , " I s t he ABC Ai r I i ne' s f I i ght from 
\Aashington, D. C. on t i rre?" The airline's... 

...Park, Calif. -based Nuance Qorrrruni cat i ons. SpeechCbj ect s are reusable 
speech corrponents that incorporate complex dialogue and vocabulary used 
i n speech appl i cat i ons. 

"VoxM_ is independent of any speech recognition engine," Pat el 
says. " Speech recognition has come a long way in recent months. However 
VoxM. is an application representation language... 
. . . engi nes. 

" Cur i rrpl errant at i on happens t o wor k on what we bel i eve i s an i ndust r y 
leader in speech recognition , but different people will have different 
i rrpl errent at i ons of the voice browser on different core technologies... 
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Spel I . 

Modern speech technology can be broken down into three categories: 
products that let computers synthesize speech from digital input ; 
those that let them recognize (or enter) speech from out side sources; 
and those that comoi ne and build on the first two categories to... 
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...available for the enabling technologies market segment, whose users are 
often on a tight budget. 

Speech synthesis technology produces spoken output either from 
text appearing on a corrputer screen or in response... 
...spreadsheets by having the corrputer read aloud ranges of numbers. 
Visually impaired users can use speech synthesis technology to access 
data and other information displayed on the computer screen. Ivbst speech 

synthesis systems also digitally record, store, edit and play back 
speech useful for sound effects and sending verbal electronic mail. IVany 
systems also offer United speech recognition that lets users verbally 
enter some software corrmands. 

VARs report surprisingly good results working with even the low- end 
speech synthesis products. "Until I beta tested the Wndows Sound 
System I thought it was a toy... 

...a long download session, that you night otherwise miss." 

Elton B. Sherwi n Jr., manager of speech recognition strategy at 
I BM , says even VARs t hat ar e not pr i mar i I y i nt er est ed in r esel I i ng speech 
syst ems. 

. . . t he compet i t i on at a r el at i vel y I ow cost , " he says. 

In contrast t o t he I ow- end speech synthesis products, the 
high- end products, such as Digital Equipment Cor p. ' s DECt al k PC Speech 
Synthesizer and Errerson & St er n Associates I nc. ' s Sound Bytes, are likely 
to be used in settings where speech i s t he primary input mechanism These 
products offer faster speaking rates, higher speech quality, user 
-definable dictionaries and rror e sophi st i cat ed pronunciation, 
punctuation and granmrar options, among other features. They are 
typically used with screen access programs capable of feeding them 
information fromthe corrputer screen. Examples include Tel eSensor y' s 
ScreenPower, Henter- Joyce I nc. ' s JAWS, and, for Ivacintosh, Berkeley 
Syst ems I nc. ' s out SPOKEN. 

Speech recognition isn't a new i dea, but it has gone beyond the 
j ust - 1 al k st age onl y. . . 

...but it turned out to be a difficult problem Speech is tremendously 
corrpl ex. " 

For decades, speech recognition was restricted mainly to high- end 
vertical markets with United vocabulary , hi gh- 1 hr oughput data entry 
needs for exarrpl e, insurance corrpani es and medical recording firms. In the 



...and efficient, and processing power dropped in price, changing the 
mar ket . 

Cberteuffer predicts that once speech recognition crosses an 
accur acy/ r obust ness/ pr i ce threshold, there will be widespread, in-mediate 
acceptance. He even suggest s. .. unt i I a corrbi nat i on of circumstances caused 
their prices to drop during the 1980s.) 

Still, current speech recognition systems, although vastly better 
than those available even a few years ago, fall far short... 

...and performance. A corrpr ehensi ve 1993 report by San Francisco market 
anal ysi s f i r m Fr ost & Sul I i van st at es: "Voice recognition i s t he I east 
developed of voice technologies and is widely considered t o be a 
t echnol ogy. . . 

...its i nf ancy. I ndust r y ex per t s agr ee i t vu II be at I east a decade bef or e 
a mature voice recognition product is introduced i n t he market." 

Ivbst nidrange speech recognition systems are speaker - dependent ; 
t hey r equi re " t r ai ni ng" via t he t edi ous r epet i t i on of key wor ds and. . . 

. . . i nt onat i on and vocal pat t er ns of part i cul ar speakers. IVany al so f al I 
short of providing truly continuous speech recognition because users 
mist insert a brief pause between words. They also tend to be overly... 
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...the corrpany offers. IBM Kurzweil Applied Intelligence Inc. (Wilt ham, 
IVass.), Dragon Systems Inc. (Newton, Ivass. ) , Voice Processing Corp. ( 
Can-bridge, IVass.), Dialogic (Parsippany, N. J.), Rhetor ex ( Carrpbel I , 
Calif.) andLernout & Hauspi e ( Br ussel s, Bel gi urr) are just a f ew of the 
companies that license or resell speech - recognition algorithms, boards 
and other components to resellers. 

Cne of the more appealing aspects of speech recognition involves 
Interactive Voice Response ( I VR) technology. Ideally, an I VR syst em 
would blend both speech synthesis and speech recognition to create 
an easy-to-use automated response syst em say, a f ax- on- demand, 
i nf or mat i on. . . 

...without further human intervention. 

In practice, however, there are I i rri t at i ons not just in present-day 
speech recognition technology, but those inherent i n t he current 
telephone infrastructure. Telephones distort speech and introduce noise, 
making over - 1 he- phone speech recognition even more difficult than the 
ordinary kind. Therefore, most current I VR syst ems rely on touch-tone, not 
spoken, user input. ( Cver - 1 he- 1 el ephone speech recognition is a 
critical issue in Europe and rruch of the rest of the world, since... 
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...TEXT: speech- cent r i c interface for wearable corrputers would have the 
foil owi ng el errent s: 

A standard set of operating system voice commands for all 

applications which control actions that are corrrron t o al I wearable devices 
and speech. . . 

. . . dat e, and so on. 

A standard set of application voice corrrrands that deal with comrDn 
application activities, such as accessing files , scrolling text and 
aski ng f or hel p. 

The body of the application does not have to... 
. . . r ecogni t i on accur acy 

In general, the out - of - 1 he- box accuracy of state-of-the-art speech 
recognition engines is not adequate for all types of applications and 
environments. Cut - of - 1 he- box. . . 

... be acconpl i shed wi t h dedi cat ed engi nes. For success, it is most of t en 
necessary to adapt the speech recognition engine to both the 
environment, including the background noise and nicrophone characteristics, 
and t he appl i cat i on. 

Speech data collection. To adapt speaker - i ndependent speech recognition 
systems, speech data mist be tailored to the application and the 
environment. Speech data means digital recordings... 
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...TEXT: corrput er skills (such as agency owners and producers). 

The three key players i n t he PCbased voice - recognition market Dragon 
Systems, Kurzweil Applied Intelligence, and I BM- al I have roughly equivalent 
capabi I i t i es, but wi t h i nevi t abl e. . . 

...that you designate. The Kurzweil syst em al I ows you to enable virtually 
any W ndowsTM sof t war e to recognize your voice corrmands. 

Star Trek gives us a picture of the ultimate goal of speech recognition 
, natural conversation wi t h a corrput er . Though, not close to the Star Trek 
level yet, the latest speech - recognition programs are closer than ever 
to reaching the ultimate goal of speaker - i ndependent , continuous dictation. 
The newest programs al I ow you to speak naturally when giving your PC 
corrmands such as " Cpen file ." But the current crop of PC- based systems 
still requires discrete speech, where users must... 

...and intuitive. The syst em worked seamlessly with our existing sound card 
without any problems. To start using the program we had to corrplete a 
si mpl e set up r out i ne t hat conf i gur ed t he sound I evel on our . . . 

...memos, letters, reports and other document s through dictation. \te were 
i rrpr essed wi t h its abi I i t y to recogni ze our voi ce . The syst em al I ows 
you to format text, navigate through the application's menus and dialogs... 

... set t i ngs, and pr evi ew and pr i nt document s usi ng i nt ui t i ve voi ce 
corrrrands. 

Kurzweil Voi cePad has an active vocabulary of 11,500 comrmnl y spoken 
words and the ability to add 500 words that are... 

...this Eval uat i on Rel ease. 

This evaluation program is a great way for you to see if voice 
recognition will wor k f or your office. The more advanced (and more 
expensive) Voice Professional v2. 5. . . 

...voice response applications. The three major reasons why agencies and 
companies have yet to errbrace voice recognition : quality, price, and 
the degree of integration between the voice software and operating 
syst errs and applications are rapidly being satisfied. 

Syst em r equi r ement s 

Cost also is no longer as large... 
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I aunched ScanSof t ( R) Dr agon Nat ur al I ySpeaki ng( R) XP Edi t i on, the 
latest release of its awar d- wi nni ng speech recognition software for 
Europe. The new Dragon Nat ur al I ySpeaki ng software is available for home, 
small business, corporate... 

...were introduced earlier this year. 

ScanSof t Dragon Nat ur al I ySpeaki ng i s t he mar ket - 1 eadi ng automatic 
dictation and speech recognition product for M cr osof t ( R) Wndows(R) 
and Wndows XP. The highly accurate speech product allows... 

...to 160 wor ds- per - rri nut e, savi ng t i rre and i ncr easi ng wor ker pr oduct i vi t y. 
Dragon Nat ur al I ySpeaki ng also enables voice - control of Mcrosoft 
Wndows operating system corrrrands and third- party PC appl i cat i ons, 
giving users an efficient way to start programs , save files , surf the 
Veb and cr eat e emai I . 

"As a I awyer , I needed a di ct at i on pr oduct t hat . . . 

...create documents and reports." 

Dragon Nat ur al I ySpeaki ng achieves its accuracy in part through a 
built-in vocabulary of rror e t han 250, 000 wor ds, and t hr ough i t s suppor t 
for adding cust om vocabul ar i es. The... 

. . . meet s t hei r speci f i c needs. These i ncl ude: 

-- The industry's highest accuracy rate for desktop automatic speech 
recognition 

-- Converting voice i nt o t ext at up t o 160 words- per - rri nut e 
-- Hands- free, voi ce- enabl ed control of... 
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noise. Andrea Ant i - Noi se( R) mcrophone technology enhances all 
voice- driven corrput i ng applications, such as speech recognition , 
telephony, speech over the Internet, videoconferencing, rrulti -party 
conferencing, mil t i rredi a and interactive games. 
Mar I i n i nt egr at es I BM s. . . 

. . . I nt er net access. Addi t i onal I y, Mar I i n wi I I i ncor por at e Java i nt o t he 
operating system allowing it to natively run Java applications and 
Internet appl et s i ndependent of a Wab browser. It will be the first major 
Intel -based operating syst em wi t h built-in speech recognition and will 
t ake t he concept of ease- of - use to an ent i r el y new I evel . 

" Wt h Mar I i n, I BM i s I eadi ng t he way t o t he next gener at i on of 
easy-to-use operating syst errs ," said John N. Andrea, co- president of 
Andrea Electronics Corporation. " Corrput i ng has clearly achieved significant 
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advances in voice - interactive operating and Andrea Electronics' 
noi se- cancel i ng products enhance all of these applications." 
Mar I i n cont i nues I BM s. . . 

...inserting pauses between words, in order to navigate through the desktop 
and menus. The navi gat i on di ct i onary i n Mar I i n wi I I i ncl ude conrron 
deskt op navi gat i on wor ds. Di ct at i on wi I I come wi t h a base vocabul ary of 
22, 000 wor ds and wi I I al I ow t he addi t i on of up t o 42, 000 rror e. . . 



24/3, K/ 5 (Item 1 fromfile: 20) 

DI ALCG( R) Fi I e 20: Di al og Q obal Reporter 
(c) 2008 Dialog. All rts. reserv. 

17222761 (USE FORMAT 7 CR 9 FCR FULLTEXT) 
Speaking to machines but do they listen? 

SECTI CN Tl TLE: Busi ness 
I Rl SH I INDEPENDENT 
June 14, 2001 

JOURNAL CCDE: FI I LANGUAGE: Engl i sh RECORD TYPE: FULLTEXT 
VCRD CCUNT: 825 

( USE FORMAT 7 CR 9 FCR FULLTEXT) 

i t , voi ce corrrrand is not all that dramatic. 
Far more irrpressive is the capability of processing speech 
directly into editable on-screen text. 

Speech to text dictation has been in development by... 



24/3, K/ 6 (Item 1 fromfile: 47) 

DI ALOG( R) Fi I e 47: Gal e Gr oup Magazi ne DB( Tlv) 
(c) 2008 The Gale group. All rts. reserv. 

02596246 SUPPLI ER NUM3ER: 00566646 ( USE FORMAT 7 CR 9 FOR FULL TEXT) 
Project: Data Base: Part 7. 

St al lings, S. 

PC Magazi ne, v3, n17, p21 8- 21 9 
Sept. 4, 1984 

DOCUMENT TYPE: evaluation ISSN: 0888-8507 LANGUAGE: ENGL I SH 

RECORD TYPE: FULLTEXT; ABSTRACT 

VCRD CCUNT: 18541 LI NE CCUNT: 01417 

on the screen, which, in turn, is passed to the pr ogr am gener at or 
to create a program for execution . As always, the resulting program can 
be edi t ed and saved. 

The VIEW module uses an. . . can be invoked directly from a menu, or can 
be reached after a REQUEST- gener at ed program has been executed or after 
an error has occurred in a request or program SALVO pr ogr arrs are divided 



. . . a bi t of pr act i ce, but this woul d be t he case wi t h any new hi gh- 1 evel 
language . The output pr ogr arrs from REQUEST and VI EW rrodul es were 
tremendously helpful as learning tools, although some detailed. ..a 
dictionary term record. If you call for that t er m i n a query, you are 
executing a program that will then create the data to be displayed on 
the report. The pr ogr am coul d. . . 



24/3, K/ 7 (Item 2 fromfile: 47) 

DI ALOG( R) Fi I e 47: Gal e G" oup Magazi ne DB( Tlv) 
(c) 2008 The Gale group. All rts. reserv. 

02582946 SUPPLI ER NUM3ER: 03420780 ( USE FORMAT 7 CR 9 FOR FULL TEXT) 
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Project: database, part 7. (evaluation) 

St al I i ngs, St ephani e 

PC Magazi ne, v3, p218(44) 

Sept 4, 1984 

DOCUMENT TYPE: Evaluation LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

VCRD COUNT: 18541 LI NE COUNT: 01417 

on the screen, which, in turn, is passed to the pr ogr am gener at or 
to create a program for execution . As always, the resulting program can 
be edi t ed and saved. 

The VIEW module uses an. . . can be invoked directly from a menu, or can 
be reached after a REQUEST- gener at ed program has been executed or after 
an error has occurred in a request or program SALVO pr ogr airs are divided 



. . . a bi t of pr act i ce, but this woul d be t he case wi t h any new hi gh- 1 evel 
language . The output programs from REQUEST and VIEW modules were 
tremendously helpful as learning tools, although some detailed. ..a 
di ct i onar y t er m r ecor d. If you cal I for t hat t er m i n a quer y , you ar e 
executing a program that will then create the data to be displayed on 
the report. The pr ogr am coul d. . . 



24/3, K/ 8 (Item 1 fromfile: 148) 

DI ALOG( R) Fi I e 148: Gale Q-oup Trade & Industry DB 
(c)2008 The Gale Group. All rts. reserv. 

09964887 SUPPLI ER NUM3ER: 20140949 (USE FORMAT 7 OR 9 FOR FULL TEXT) 
ART Announces snrARTconmrander f or Pal m PC 

PRNewswire, p110CGSA001 
Jan 10, 1998 

LANGUAGE: English RECORD TYPE: Full text 

VCRD COUNT: 513 LI NE COUNT: 00050 

Prorrpts and helpful browsers simplify pr ogr arming. Users can easily 
add new words to srrARTcorrrrander ' s vocabulary to build their own personal 
voi ce I i brary. 

srrARTcorrrrander wi I I be avai I abl e in t he Spr i ng of . . . 



24/3, K/ 9 (Item 2 fromfile: 148) 

DI ALOG( R) Fi I e 148: Gal e Group Trade & I ndust ry DB 
(c)2008 The Gale Group. All rts. reserv. 

08760026 SUPPLI ER NUM3ER: 18361016 (USE FORMAT 7 OR 9 FOR FULL TEXT) 
At Comdex: f r om Mar I i n to M cr osof t . ( I BM s Mar I i n oper at i ng syst em 
products f rom M crosof t ) (Industry Trend or Event) 

R char ds, Kat hi een 

Corrputer Retail Wiek, v6, n136, p1(2) 
June 3, 1996 

ISSN: 1066-7598 LANGUAGE: English RECORD TYPE: Ful I t ext ; Abstract 

VCRD CCUr^: 572 LI NE COUNT: 00048 

pages usi ng voi ce corrrrands. IS/fer I i n wi I I al so f eat ur e di ct at i on 
capabi I i t y, wi t h a bui I t - i n st ar t er vocabul ary of 10, 000 words, and wi I I 
broaden support to include Java applications and True Type... 



24/3, K/ 10 (Item 1 fromfile: 275) 

DI ALOG( R) Fi I e 275: Gal e Group Computer DB(Tlv) 
(c) 2008 The Gale Group. All rts. reserv. 

01692974 SUPPLI ER NUM3ER: 15518865 (USE FORMAT 7 CR 9 FOR FULL TEXT) 
I BM s replacerrent for the keyboard: voice recognition syst errs. (IBM 
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Personal Dictation Syst err) (Hardware Review) (includes related articles 
on t he ASI C chi p handl i ng rrost of t he voi ce pr ocessi ng al gor i t hrrs, t he 
systerrls Voice Action Editor and a user's vi ew of the syst err) 
( Eval uat i on) 

Boxer, Steve 

PC User, n235, p40(4) 

May 18, 1994 

DOCUMENT TYPE: Eval uat i on I SSN: 0263- 5720 LANGUAGE: ENGLI SH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 3516 LI NE COUNT: 00261 

a macro language; instead, you're given a choice of all possible 
voice or keyboard- activated corrmands in that application (OS/2 2.1 is 
very good at this) and you just pick and choose... 

... I PDS, t he mor e convi need you become t hat it sur el y f or ms t he basi s of a 
future voice - input operating system. 



24/ 3, K/ 1 1 ( 1 1 em 2 f r om f i I e: 275) 

Dl ALOG( R) Fi I e 275: Gal e Q-oup Computer DB( TM, 
(c) 2008 The Gale Q-oup. All rts. reserv. 

01433820 SUPPLI ER NUM3ER: 10769161 (USE FORMAT 7 OR 9 FOR FULL TEXT) 
Natural language corres of age. 

Var hoi , Pet er D. 

Personal Workstation, v3, n5, p22(3) 
May, 1991 

ISSN: 1047-4013 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

V\CRD COUNT: 1694 LI NE COUNT: 00134 

mist define or gani zat i on- speci f i c words in terms of words that are 
in Natural Language's lexicon , or in terms of an SOL query. This enables 
the user to phrase queries usi ng. . . t echnol ogy than database query? The 
answer has to be yes. Any software that uses a corrrrand language or a 
rrenui ng interface is a candidate for natural language. In fact, any 
operating system that does not have an integrated graphical user 
interface should consider using natural language. 
Mer e. . . 

... bet ween corrput er and user . Once t he appr opr i at e syst em cal Is ar e bui I t 
into the operating system applications can start using those calls to 
present their own nat ur al I anguage interface. This could not only 
r evol ut i oni ze t he. . . 

. . . sof t war e, but i t coul d al so pr ovi de t he basi s f or the most i nt ui t i ve 
i nt er f ace of af I : voi ce r ecogni t i on . The pr obi errs of i rrpl errent i ng bot h 
technologies are si m I ar i n t hat both attempt to understand the syntax and 
semantics of the English language . Voice recognition , corrpl i cat ed by 
rri I I i ons of di f f er ent accent s and i nf I ect i ons, i s f ar rror e di f f i cul t . 
However, conquering the... 



24/3, K/ 12 (Item 1 fromfile: 610) 

DI ALOG<R) File 610: Business W re 

(c) 2008 Business Wre. All rts. reserv. 

00860814 20030303062B2234 (USE FORMAT 7 FOR FULLTEXT) 

ScanSoft Ships Dragon Nat ural I ySpeaki ng 7; Speech Recognition Goes 
Mainstream Wth Mast Accurate and Fastest Release Ever- Si gni f i cant New 
Features and Enhancerrent s Result in 50 Percent I rrpr overrent in Performance; 
Accuracy Rat es. . . 

Monday, March 3, 2003 08:02 EST 

JOURNAL CODE: BW LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 
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DCCU^M" TYPE: NEV\SW RE 
VCRD CCU^: 1 , 308 

... t he wr i t i ng 

style of the user, and the Performance Assistant, which optimizes the 
efficiency of speech recognition based on a user's workflow. In total, 
t he 

accuracy and performance i rrpr overrent s within Dragon Nat ur al I ySpeaki ng 7 
deliver an unmatched level of productivity and ease of use for speech 
recognition products. 

" Dr agon Nat ur al I ySpeaki ng has been a boon t o t hose who need to I i rri t t hei r 
typing... 

...accuracy and ease- of -use advancement s in Dragon Nat ur al I ySpeaki ng 7 
represent a step forward in speech recognition technology for users at 
horre 

and in t he of f i ce. Dr agon Nat ur al I ySpeaki ng 7 was desi gned and. . . 
. . . I nt el i s 

pleased to have worked with ScanSoft, a member of the I nt el ( R) Early 
Access 

Program, to deliver the performance needed to make speech recognition 

a 

val uabl e sol ut i on f or al I PC user s. " 

The new release also includes expanded network tools... 

...throughout an enterprise, a 

key requirement for large healthcare organizations and corporations looking 
to 

i rrpl errent speech recognition solutions. 

" Ivany years of leadership in imaging, and more recently speech 

t echnol ogi es, 

have provided ScanSoft... 

...yielded an unprecedented accuracy 

i rrpr overrent of up t o 15% over the previous release, resulting in speech 
recognition accuracy levels as high as 99 percent. -- Fastest Ever - IVbre 
t han 

50 percent i rrpr overrent in... 
. . . Di ct at i on 

Through Natural Punctuation - Wii I e professionals are accustomed to 
speaki ng 

punctuations, Dragon Nat ur al I ySpeaki ng 7 brings speech recognition to 
mai nst r earn user s by el i rri nat i ng t he need t o say " per i od" or " corrma" when 
cr eat i ng e. . . 

. . . r ecor der s, tor ecor d rrobi I e di ct at i on f or aut omat i c t r anscr i pt i on when 
they synch to their PCs. -- New Vocabulary Cpt i rri zer - Users can instruct 
Dragon Nat ur al I ySpeaki ng to instantly analyze sentence structure and word 
use 

f r equency. . . 

...patterns, thereby further increasing accuracy. -- IVbst Accessible Ever 
- Dragon Nat ur al I ySpeaki ng 7 is the first desktop speech recognition 
sol ut i on 

to be certified as c o rrpl i ant with U.S. Section 508, which is focused on... 
...with ScanSoft Real Speak Proofing, which reads 

email and document text aloud in a human- soundi ng synthesized voice . -- 
New 

Per f or mance Assi st ant - Dragon Nat ur al I ySpeaki ng now opt i rri zes itself for 
each 
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user's application usage patterns. -- Accur at e Voi ce Control of the PC - 
Dragon Nat ur al I ySpeaki ng has irrproved voice conrrrand and control 

capabi I i t i es, 

whi ch al I ows user s t o use t hei r voi ce t o i nt er act wi t h t he PC oper at i ng 
syst em 

and control application menus. Ivbst applications gain voice control 
capabi I i t i es by si rrpl y i nst al I i ng Dr agon Nat ur al I ySpeaki ng ont o t he PC. 
Expanded Cust om Voi ce Shortcuts - The ability... 



A 24/3, K/ 13 (Item2 fromfi I e: 610) 

Dl ALOG( R) Fi I e 610: Business Wre 

(c) 2008 Business Wre. All rts. reserv. 

00726422 200206061 57B201 9 (USE FORMAT 7 FCR FULLTEXT) 

ScanSoft Launches Dragon Nat ural I ySpeaki ng XP Edition in Europe with the 
Broadest Language Support Avai I abl e- New Dragon Nat ur al I ySpeaki ng( R) XP 
Edition Supports Aut orrat i c Dictation in Five European Languages; Horre, 
Of f i ce and. . . 

Busi ness Wre 

Thursday, June 6, 2002 08:02 EDT 

JOURNAL CODE: BW LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 
DOCUMENT TYPE: NEWSW RE 
WJRD COUNT: 823 

ScanSoft Dragon Nat ur al I ySpeaki ng i s t he mar ket - 1 eadi ng automatic dictation 
and speech recognition product for Mcrosoft(R) Wndows(R) and W ndows 
XP. The 

highly accurate speech product allows... 
. . . to 160 

wor ds- per - rri nut e, saving t i rre and increasing worker productivity. Dragon 
Nat ur al I ySpeaki ng also enables voice - control of Mcrosoft W ndows 
oper at i ng 

system corrrrands and third-party PC appl i cat i ons, giving users an 
ef f i ci ent way 

to start programs , save files , surf the Wab and create email. 
"As a lawyer, I needed a dictation product that... 
...create documents and reports." 

Dragon Nat ur al I ySpeaki ng achieves its accuracy in part through a built-in 
vocabulary of rrore than 250,000 words, and through its support for adding 
cust om vocabul ar i es. The... 

. . . These 
i ncl ude: 

Dr agon Nat ur al I ySpeaki ng( R) XP Edi t i on St andar d i s a hi ghl y 
accurate, easy-to-use speech recognition product focused on 
creating text documents and e-mails for home computer users 
and t hei r . . . 

. . . Dr agon Nat ur al I ySpeaki ng( R) XP Edi t i on Pr ef er r ed is a full 

featured, highly accurate, easy-to-use speech recognition 
product focused on creating and editing text documents, 
reports and e- rrai I s for srral I office and home office users. 
Dragon Nat ur al I ySpeaki ng( R) Professional is ScanSof t ' s leading 
speech recognition product aimed at the professional and 
corporate user. DNS Professional is a powerful productivity 
t ool . . . 

. . . Ger man and i s desi gned for t he speci al needs of I egal 
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professionals. It includes an extensive vocabulary of legal 
terms, phrases and acronyms. 

"ScanSoft is breaking new ground with its first branded speech 
recogni t i on 

product I i ne f or the European mar ket pi ace. By offering Dragon 
Nat ur al I ySpeaki ng( R) XP Edi t i on in di f f er ent . . . 

. . . i ncl ude ( f eat ur es may 
var y by edi t i on) : 

-- The industry's highest accuracy rate for desktop automatic speech 

recognition -- Converting voice into text at up to 160 wor ds- per - rri nut e 

Hands- free, voi ce- enabl ed control of Mcrosoft Wndows- based 

appl i cat i ons - - Abi I i t y to expand and cust orri ze t he vocabul ary - - Cpt i on 
t o 

have e-mails and other documents read aloud using the 
award- wi nni ng Real Speak. . . 



24/3, K/ 14 (Item 1 fromfile: 624) 

DI ALCG( R) Fi I e 624: MdGt aw Hi I I Publ i cat i ons 
(c) 2008 M:GrawHill Co. Inc. All rts. reserv. 

00890759 

' OOvPUTER, SAVE THI S FI LE' : A software program I et s you run your PC by 
t al ki ng t o i t in everyday English 

BY STEPHEN H. W LDSTRCM 
Business V6ek, Nurrber 3551, Pg 22 
Noverrioer 3, 1997 
JOURNAL CCDE: BW 

SECTI CN HEADI NG: Technol ogy & You ISSN: 0007-7135 
VCRD COUNT: 578 

TEXT: 

... 000 or rrore for an effective dictation system The I i rri t ed vocabulary 
also rreans you can start to use the program without the tedious speech 
training required by dictation systems. 

Voi ceCorrrrands can handl e al I of t he. . . 

. . . bug. 

CGVBO. Q course, what we really need is a syst em t hat corrbi nes dictation 
with voice control --and works wi t h al I the programs on your corrput er . 
Making such software function both smoothly and consistently requires that 
the operating system itself support speech recognition . Apple 
Computer Mac OS included a version of speech command- and- cont r ol called 
PI ai nTal k f or . . . 



24/3, K/ 15 (Iteml fromfile: 636) 

DI ALOG( R) Fi I e 636: Gal e Or oup Newsl et t er DB( Tlv) 
(c) 2008 The Gale Q-oup. All rts. reserv. 

01494985 Supplier Nurrtoer : 42094803 ( USE FORMAT 7 FOR FULLTEXT) 
Fi f t h- gener at i on corrput er project in its final year 

NTI S Foreign Technology Newsletter, v91, n22, pN A 
May 22, 1991 
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Language: English Record Type: Full text 
Document Type: Magazi ne/ Jour nal ; Trade 
W)rd Count : 2179 

Neumann language program In addition, I COT has developed the 
world's only full-scale logic language operating syst errs - SI IVPCS, 
for its sequential machines, and PI M3S, for its parallel ones. Finally, the 
I at est ver si ons. . . 

...processors, and then to reintegrate those parts back into the program 
I COT s non- numer i cal programs do not run efficiently on such machines. 
Cn the other hand, since the Cray and other parallel corrput er s. . . 

...in their operation, nor do they translate between languages. Efforts 
have not been made toward voice recognition because, according to I COT 
officials, such a corrput er would be a "special" corrput er and I COT. . . 
...VLSI CAD appl i cat i ons, theoretical mat h syst errs, a variety of expert 
decision systems, legal reasoning and grarrrrar /syntax programs, genetic 
sequencing and other applications in biology, and go- pi ayi ng programs 
(si rri I ar to... 



24/3, KM 6 (Item 1 fromfile: 647) 

Dl ALOG( R) Fi I e 647: GVP Corrput er Full text 
(c) 2008 CVP Madi a, LLC. All rts. reserv. 

01092989 QVP ACCESS I CN NUM3ER: CRV\fl 9960603S0004 
At Comdex: Fr om Iver I i n t o M cr osof t 

Kat hi een R char ds 

CGVPUTER RETAIL \AEEK, 1996, n 636, PQ01 

PUBLI CATI CN DATE: 960603 

JOURNAL CODE: CRW LANGUAGE: English 

RECORD TYPE: Ful I t ext 

SECTI CN HEADI NG: News 

VCRD COUNT: 525 

bet a t hi s rront h and ship in the second hal f of 1 996 , Iver I i n is 
the first operating system to offer integrated speech recognition 
that will allow users to launch files and even navigate Internet Web 
pages using voice cormrands . Marl in will also feature dictation 
capability, wi t h a bui I t - i n starter vocabulary of 10,000 words, and 
wi I I br oaden suppor t t o i ncl ude Java appl i cat i ons and Tr ue Type. . . 



29/3, KM (Item 1 fromfile: 9) 

Dl ALCG( R) Fi I e 9: Busi ness &lndustry(R) 
(c) 2008 The Gale G-oup. All rts. reserv. 

01424341 Supplier Nurrber : 24072084 (USE FORMAT 7 CR 9 FOR FULLTEXT) 
Call Centers: Custom zing the Cust orrer Approach 

(Call centers are increasingly becorri ng a rrore central, integrating hub for 
linking custorrers and in-house work processes; by 1998, average spending 
on call center technology by banks with over $1 bi I in deposits will 
reach t he $5 rri I t o $6 rri I level) 

Financial Ser vi ce CNLI NE, p S27- S30 
Noverrber 1997 

DOCUMENT TYPE: Journal; I ndust r y Over vi ew ( Uni t ed St at es) 
LANGUAGE: English RECORD TYPE: Ful I t ext 
VCRD COUNT: 2093 

( USE FORMAT 7 CR 9 FOR FULLTEXT) 

TEXT: 

...marketing for Synt el I ect of Roswel I , GA, important areas of devel oprrent 
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include call center intelligence, "virtual access " applications , and 
I ar ge- vocabul ar y, speaker - i ndependent speech recognition. 

Put t i ng i nt el I i pence wi t hi n t he cal I cent er i s essent i al . . . 

. . . appl i cat i on is corrpat i bl e wi t h Act i veX, M cr osof t ' s CLE and Dynarri c 

Document Exchange (DDE). 

A vi r t ual access appl i cat i on is one t hat gi ves cust orrer s the abi I i t y to 
access the same bank services using their... 

. . . and by t el ephone. 

Another thing that is new f or call centers, says Reece, is large- 
vocabulary , speaker - i ndependent speech recognition . " Ivbst of the 
speech recognition that has been deployed up to this point has been 
pretty si rrpl i st i c. It can understand... 

. . . can do t hat . It's del i ver abl e. " 

The Vocal Poi nt SpeechReco Server has a 20,000 word vocabulary per 
application, and it delivers voice recognition capabilities to I VRs on 
an as- needed basis. It's phonetics based, says Reece, and... 

. . . t hem i n the system Another irrportant feature is grarrrrar spotting, which 
can pick out noise words and link key words into phrases to structure 
a corrrrand. 

Reece says that SpeechReco has unl i rri t ed possibilities for applications , 
all of which need to be cust om desi gned -- by Syntellect or by the 
f i nanci al i nst i t ut i on. . . 

29/3.K/2 (Item 1 fromfile: 15) 

Dl ALOG( R) Fi I e 1 5: ABI / 1 nf or mf R) 

(c) 2008 ProCuest I nf o&Lear ni ng. All rts. reserv. 

02154653 71749993 

A virtual investigation of global contact center technologies 

Tehr ani , R ch 

Cust orrer I nt er <§Dt i on Solutions v19n10 PP: 10-17 Apr 2001 
JRNL CCDE: TLM 
WCRD OOJNT: 4224 

...TEXT: the speech recognition business since 1972! 

SpeechPearl 2000. It's amazing what 30 years of speech recognition has 
yielded. SpeechPearl 2000 i s t he latest version of the Philip's core engine 
for host -based, large vocabulary natural language recognition , which 
al I ows user s t o devel op appl i cat i ons wi t h t r ue gr amw s. Thi s al I ows end 
users to interact with the speech recognition syst em as if it were a 
I i ve person. 

I n pr act i cal t er its, t hi s al I ows a cust orrer . . . 

...to rry savi ngs account nurrber 12345. " Errpl oyees of Phi I i ps t el I me t hei r 
use of open grarrrrar technology allows users to develop applications more 
quickly, as speech recognition and natural language understanding can 
actually be done in two different steps. This is in contrast to closed 
gran-mars in which each utterance rrust be previously matched . 

Thi s t ype of f I exi bi I i t y al I ows users t o organi ze grarrmars f or t hi ngs such 
as location, useful in an application such as directory assistance, or 
even the cost and type of food for a restaurant... 

...you to design full call flow. The syst em then outputs the equivalent 
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code needed to run your application . This corrponent features something 
cal I ed rri xed i ni t i at i ve di al og, whi ch al I ows t he syst em t o det er rri ne whi ch 



29/3, K/ 3 (Item 2 fromfile: 15) 

DI ALOG( R) Fi I e 1 5: ABI / I nf or m( R) 

(c) 2008 ProQuest I nf o&Lear ni ng. All rts. reserv. 

01541333 01-92321 

Custom zing the custorrer approach 

Anonymous 

US Banker v107n11 PP: S27- S30 Nov 1997 
ISSN: 0148-8848 JRNLOCDE: USI 
VCRD CCUNT: 1936 

...TEXT: marketing for Synt el I ect of Roswel I , GA, irrportant areas of 
development include call center intelligence, ' virtual access " 
applications , and I ar ge- vocabul ar y, speaker - i ndependent speech 
r ecogni t i on. 

Put t i ng i nt el I i gence wi t hi n t he cal I cent er i s essent i al . . . 

. . . appl i cat i on is corrpat i bl e wi t h Act i veX, M cr osof t ' s CLE and Dynarri c 
Document Exchange (DDE). 

A vi r t ual access appl i cat i on is one t hat gi ves cust omer s the abi I i t y to 
access the sarre bank services using their... 

. . . and by t el ephone. 

Another thing that is new for call centers, says Reece, is large- 
vocabulary , speaker - i ndependent speech recognition . " Ivbst of the 
speech recognition that has been deployed up to this point has been 
pr et t y si rrpl i st i c. It can under st and. .. can do t hat . It's deliverable." 

The Vocal Poi nt SpeechReco Server has a 20,000 word vocabulary per 
application, and it delivers voice recognition capabilities to IPs on 
an as- needed basis. It's phonetics based, says Reece, and... 

. . . t hem i n the system Another irrportant feature is grarrrrar spotting, which 
can pick out noise words and link key words into phrases to structure 
a corrrrand. 

Reece says that SpeechReco has unlimted possibilities for applications , 
all of which need to be cust om desi gned - by Synt el I ect or by the financial 
i nst i t ut i on. . . 
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Speech recognition: From the lab to the real world 

Cameron, Hugh; Heffner, Ken; Panchol y, Rajiv 
Tel esi s n1 01 PP: 28-31+ Sep 1996 
ISSN: 0040-2710 JRNLCCDE: TLS 
VCRD CCUNT: 4478 

...TEXT: Until recently, t he cost and effort t o devel op f i xed vocabul ar i es 
limited wide- scale use of speech recognition . The investment could be 
j ust i f i ed onl y f or appl i cat i ons I i ke AABS, i n whi ch a srral I f i xed 
vocabulary - that is, "yes" and "no" responses - needs to be recognized, 
and wher e t he pot ent i al cal I vol urre is I ar ge. For t he vast maj or i t y of 
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voice- activated t el ecorrrruni cat i ons applications to become practical, 
t he cr eat i on of new vocabul ar i es woul d have t o be di vor ced f r om t he. . . 

...that can be pronounced in different ways, likely alternatives can be 
stored i n t he target vocabulary set. 

FVR was one of t he cr i t i cal el enrent s i n success of Nor t el' s St ockTal k, t he 
world's first large- vocabulary , dial-up speech - recognition service. 
I nt r oduced as an i nt er nal t r i al ser vi ce i n 1992 and now handl i ng up t o 4. . . 
...that more naturally rri rri cs human conversations. 

Rejection enables the system to decide whether a given speech input is 
not a good match for any word modeled i n t he target vocabulary . This 
capability is parti culary useful in applications where the target 
vocabulary is not intuitively obvious to users, or where callers are not 
expecting their requests to... 

...example, t he r ecogni zer rri st akes an out - of - vocabul ar y i nput for a word 
i n t he target vocabulary ); and false rejection (where a caller's valid 
invocabulary request is deerred to be out -of- vocabulary ). 

To ensure the robustness of its speech - recognition technology under the 
challenges posed by the t el ecorrrruni cat i ons environment, Nortel engineered 
FVR pr epr ocessi ng t echni ques to... 

...solutions, Nortel is evolving St ockTal k and other speech applications in 
a new direct ion with continuous- speech recognition and i nt ui t i ve- di al og 
recognition (see page 35). 

Unlike FVR t echnol ogy where recognition is I i rri t ed to individual words or 
short phrases, continuous- speech recognition will enable a syst em t o 
r ecogni ze mil t i pi e vocabulary i t errs wi t hi n a si ngl e speech input , 
such as those found in complete sentences or sentence fragments. To achieve 
this capability, t he. . . Nort el ' s recently developed connected digit 
recognition ( CDR) technology. CDR wi I I be employed in voice- activated 
dialing ( VAD) , an application that allows users to quickly trigger calls 
by speaking the string of digits for a... 
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Voice recognition - An opportunity being rri ssed? 

Tyl er , Geof f 

IVanagerrent Services v39n11 PP: 30-32 Nov 1995 
ISSN: 0307-6768 JRNLCCDE: MMS 
VCRD CCUNT: 2402 

. . . TEXT: sol i ci t or s, IVasons. He i s among t he peopl e who have f ound 

corrpl errent ar y i n- of f i ce and rrobi I e applications and starts with a word 

of caut i on. 

The principle of speaking to one's computer instead of . . . so one's existing 
har dwar e coul d be cal I ed i nt o pi ay. It al so has a f ul I appl i cat i ons 
prograrming interface for linking , for instance, to proprietary word 
pr ocessi ng syst errs. 

There is one further devel oprrent which boggles the rri nd somewhat. Chce the 
voi ce. . . 

. . . whi ch way as such syst errs al I ow us to do. 1 1 can al so be used as i nput 
to a language translation system 
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A I ot of regular travellers now use pocket si zed corrput er s cont ai ni ng 
foreign language electronic dictionaries which translate single words or 
short passages of text typed on their keyboards and display... 

. . . devel oped i n i t s own I ab or at or i es in London a syst em wi t h a 250, 000 
word vocabulary and the ability to recognise continuous speech at 60 
t o 70 wor ds a rri nut e. It wi I I del i ver t he t r ansl at ed r esul t in scr eened. . . 
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Hot products: Dragon Systems DragonDi ct at e for Wndows 2.0 speech 
recognition and dictation software 

Ni e, Nor man 

Corrput er Reseller News n661 PP: 194 Dec 4, 1995 
ISSN: 0893-8377 JRNLOCDE: CRN 
VCRD CCUNT: 580 

...TEXT: different price points, differentiated mainly by the nurrber of 
words that each contains in its dictionary . 

Descr i pt i on 

The speech - recognition software enables control of Wndows using voice 
commands and dictation into nearly any application, including Wjrd Excel 
and WordPerfect. The Q assi c Edition includes a 30, 000- word active 
vocabulary list and a 100,000+ word backup di ct i onar y. Dr agonDi ct at e is 
available in British English, German... 

...to run after installation and setup, the engineer found running the 
tutorial and quick training programs necessary before getting started . 
Distinct speech (spoken speech where there is a pause after every word) is 
necessary f or . . . 

...Center engineers were impressed with Dr agonDi ct at e' s spoken- word 
recognition speed. Also irrpressive was the program's automatic checklist 
feature. Wien it hears a word it does not recognize, it prorrpts from a list 
of known words for selection of a possible match or the addition of a 
newly spoken one. As words are accepted, Dr agonDi ct at e "learns" and... 
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Wist breaks new ground with LawTal k: Voice recognition for WfSTLAW 

Gri f f i t h, Gary 

I nf or mat i on Today v11n3 PP: 10-11 Mar 1994 
ISSN: 8755-6286 JRNLCCDE: I FT 
VCRD CQJHT: 1320 

. . . TEXT: t hough i t s narre shoul d be Jack or Ji I I i nst ead of a mom t or wi t h a 
rricrophone. " Cpen files . " the person says. And by the latest niracle of 
t echnol ogy t hat ' s exact I y what it... 

. . . t han si rrpl y t ur ni ng it on and t al ki ng to it. 

LawTal k is based on Dr agonDi ct at e, a voice recognition engine produced 
by Dragon Systems, Inc. from Newton, Massachusetts. Dr agonDi ct at e contains 
an extensive vocabulary (30,000 words), with an additional 1 10,000 words 
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i n a backup I i st . Accor di ng to... 
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00622631 92-37733 
Voice Processing Syst errs 

Hel rrs, Q enn L. 

Internal Auditor v49n3 PP: 62-68 Jun 1992 
I SSN: 0020- 5745 J RNL OCDE: I AU 
VCRD CCUr^: 3062 

...TEXT: keying in a password. Any individual who has knowledge of the 
password can gain unauthorized access to files . In order to access a 
voi ce syst em t he user i s r equi r ed t o speak a passwor d. A voi cepr i nt is 
cr eat ed. . . 

...In such cases, developers should consider acquiring directional 
rricrophones which filter out irrelevant background sounds. 

Speech recognition syst errs also have behavioral i rrpl i cat i ons. Users who 
train a speaker - i ndependent syst em t ypi cal I y talk in a more formal voice 
when they construct a vocabulary . Wien the syst em becomes operational and 
the user speaks in his/her natural voice with... 

...recorded standardized voice responses are not as easy to edit as typed 
text. Sorret i rres the vocabulary is recorded by several individuals, which 
may also make the voice output syst errs difficult to understand. In 
order to avoid user dissatisfaction, voice output syst errs should use a 
pleasant human voice recorded by one individual. 

Many voice output syst errs. . . dat a processing applications. The speaker 
verification option can ensure that only authorized users are granted 
access to application and data. The speaker verification process can be 
set at a I ow t ol er ance so t hat . . . 
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Corrput er Reseller News, p184 
Dec 4, 1995 

Language: English Record Type: Full text 
Document Type: Magazi ne/ Jour nal ; Trade 
W)rd Count : 3870 

different price points, differentiated mainly by the nurrber of 
words that each contains in its dictionary . 
Descr i pt i on 

The speech - recognition software enables control of Wndows using 
voice corrmands and dictation into nearly any application, including W)rd, 
Excel and WjrdPerfect. The Classic Edition includes a 30, 000- word active 
vocabulary list and a 100, 000+ word backup di ct i onar y. Dr agonDi ct at e is 
available in British English, German... 

...to run after installation and setup, the engineer found running the 
tutorial and quick training programs necessary before getting started . 
Distinct speech (spoken speech where there is a pause after every word) is 
necessary f or . . . 
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...Test Center engineers were irrpressed wi t hDr agonDi ct at e' s spoken- word 
r ecogni t i on speed. Al so i rrpr essi ve was t he program ' s aut orrat i c checkl i st 
f eat ur e. Wien i t hear s a wor d it does not r ecogni ze, i t pr orrpt s f r om a list 
of known words for selection of a possible match or the addition of a 
newly spoken one. As words are accepted, Dr agonDi ct at e "learns" and... 
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CANADA: SPEECH RECOGNI Tl CN SCFTW\RE MARKET ( 1 ) 

VI KTCRI A PALFI , HAM D SHI RAZI 
I NDUSTRY SECTCR ANALYSI S 
May 25, 1999 

JOURNAL CODE: FI SA LANGUAGE: English RECORD TYPE: FULLTEXT 
WORD CCU^: 4786 

( USE FORMAT 7 CR 9 FCR FULLTEXT) 

in the growth of speech technology. She said that, "handheld 
devices that incorporate cell-phone links , PDA applications , 
handwriting entry , and infrared connectivity will be physical necessities 
f or corrrruni cat i on in t he next rri I I enni um and wi I I . . . 
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Lip service, (voice recognition sof t war e) ( Eval uat i on) 

Korrando, Ki m 

Popular Mechanics, v175, n6, p100(4) 
June, 1998 

DOCUMENT TYPE: Evaluation ISSN: 0032-4558 LANGUAGE: English 

RECORD TYPE: Ful I t ext ; Abst r act 

WCRD CCU^: 2062 LI NE CCU^: 00158 

another" al so was f unny, but I'll spare dear Mditi ) During one of rry 
test runs , both programs recorded "a 2 and a 6" as "a 286" - - i nt er est i ng 
in t hat rrost 286s t hat . . . 

. . . around 9 ar e i ndeed of pal e net al . 

I also found it noteworthy that although both programs claim to 
match context for I i ke- soundi ng words , Vi aVoi ce used "accept" instead of 
"except" i n t he I ast sentence of the first paragraph. I... 

. . . And wi t h M cr osof t W)r d, t hey' r e f I agged as I t ype t hem ) 

On the other hand, speech recognition software can only type 
words that are in its dictionary . That means that when you use this 
software, each word is spelled cor r ect I y- - even if... 

...voice corrrrands to operate your system For exarrpl e, issuing the corrrrand 
"open" automatically selects the Cpen corrrrand from the File menu. This 
isn't quite as easy as it seems. For example, suppose you're... 
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Wien You Talk, Your PC Listens. 

Schoen, J. 

PC Magazi ne, v4, n5, p122-132 
March 5, 1985 

DOCUMEr^ TYPE: evaluation ISSN: 0888-8507 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD CCUr^: 5773 LI NE COUNT: 00447 

input syst em at a reasonable price, t he AudoPi I ot 1000 i s t he best 
I r evi ewed. 

TEQVAR VOICE RECCGNI Tl ON BOARD 

The Tecrrar PC- Mat e Voice Recognition Board falls between the 
VocaLi nk and AudoPi I ot in price ($995 plus optional m'crophone), 
vocabulary capacity, versatility, and ease of use. The manual was si rrpl e 
to f ol I ow and t he sof t war e. . . 

...installation problems, this board stands as a good exarrpl e of the solid 
potential of reliable voice input in various application programs. 

Tecmar's board, like the VocaLi nk, uses its own memory to store the 
vocabulary and speech terrplates, and so it does not require syst em memor y. 
Though onl y 8K of . . . 

... a vocabul ary of 100 words. Wt h a 16K opt i on, a 200- word vocabul ary i s 
available. ( Programs that do not run under PC- DCS, however, are 
apparently not available t o t he board. The board recognized rry . . . 

. . . addi t i ons ar e f ai r I y si rrpl e. 

The routine also stores files (you give a filename when you start 
the program) and makes a backup copy automatically. Once you enter and 
store corrmands, you train them.. 

...the conrrands with additional repetitions. One helpful feature of the 
training mode i s t hat the program produces two test results as you speak 
each word, showing how wel I the spoken word matches the voice print 
stored so far. Once trained, the terrplate is stored in a file that you 
narre separ at el y. 

The boar d' s ut i I i t y pr ogr am like t he VocaLi nk, al so has a. . . 
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Wien you talk, your PC list ens. (evaluation) 

Schoen, John 

PC Magazi ne, v4, p122( 11) 
March 5, 1985 
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i nput syst em at a r easonabl e pr i ce, t he AudoPi I ot 1 000 i s t he best 
I r evi ewed. 

TEQVAR VOICE RECCGNI Tl ON BOARD 

The Tecmar PC- Mat e Voice Recognition Board falls between the 
VocaLi nk and AudoPi I ot in price ($995 plus optional m'crophone), 
vocabulary capacity, versatility, and ease of use. The manual was si rrpl e 
to f ol I ow and t he sof t war e. . . 

...installation problems, this board stands as a good exarrpl e of the solid 
potential of reliable voice input in various application programs. 

Tecmar's board, like the VocaLi nk, uses its own memory to store the 
vocabulary and speech terrplates, and so it does not require syst em memor y. 
Though onl y 8K of . . . 
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... a vocabul ary of 100 words. Wt h a 16K opt i on, a 200- word vocabul ary i s 
available. ( Progranrs that do not run under PC- DCS, however, are 
apparently not available t o t he board. The board recognized rry. . . 

. . . addi t i ons ar e f ai r I y si rrpl e. 

The routine also stores files (you give a filename when you start 
the program) and makes a backup copy automatically. Cnce you enter and 
store corrmands, you train them.. 

...the corrmands with additional repetitions. Che helpful feature of the 
training mode i s t hat the program produces two test results as you speak 
each word, showing how wel I the spoken word matches the voice print 
st or ed so f ar . Cnce t r ai ned, t he t errpl at e i s st or ed i n a file t hat you 
narre separ at el y. 

The boar d' s ut i I i t y pr ogr am I i ke t he VocaLi nk, al so has a. . . 
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A Virtual Investigation a Q obal Contact Center Technol ogi es. ( Company 
Business and Marketing) 

Tehr ani , R ch 

Customer Interaction Solutions, 19, 10, 10 
Apri I , 2001 

LANGUAGE: English RECORD TYPE: Full text 

VCRD COUNT: 4455 LI NE COUNT: 00347 

you t o desi gn f ul I cal I f I ow. The syst em t hen out put s t he 
equivalent code needed to run your application . This corrponent features 
sorrBt hi ng cal I ed rri xed i ni t i at i ve di al og, whi ch al I ows t he syst em t o 
det er rri ne whi ch. . . 
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Adl er , Al I an 

Computer Reseller News, n661 , p174(1) 
Dec 4 1 995 

DOCUMENT TYPE: Col urm ISSN: 0893-8377 LANGUAGE: English 

RECORD TYPE: Ful I t ext ; Abst r act 
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Test Center engineers were impressed wi t hDr agonDi ct at e' s 
spoken- word recognition speed. Also impressive was the program's 
automatic checklist feature. Wien it hears a word it does not recognize, it 
prorrpts from a list of known words for selection of a possible match or 
the addition of a newly spoken one. As words are accepted, Dr agonDi ct at e 
" I ear ns" and. . . 
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Voice training. (Mcrosoft Corp. ' s Wndows Sound Syst err) (Product 
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Announcenrent ) 

Desrrar ai s, Nor man 
CD-RCMW)rld, v9, n2, p64( 3) 
Feb, 1994 

DOCU^NT TYPE: Product Announcerrent ISSN: 1066-274X LANGUAGE: 

ENGL I SH RECORD TYPE: FULLTEXT 

WCRD CCUNT: 1405 LI NE COUNT: 00112 

voice model for use on different corrputers. 
Wii I e many of the same corrrrands such as Fi I e , Cpen , Page Up, and 
Page Down apply t o al I Wndows programs, Mcrosoft's applications do not.. 

... Syst em owner s need not restrict themselves to Voice Pilot corrrrands, 
however. They can use the program to activate keystroke corrbi nat i ons 
j ust I i ke any macr o - - say, a keyst r oke t hat adds an i I I ust r at i on of a. . . 
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have 20 entries and can contain information from 20 input 
sentences. Next, the train routine opens the dictionary file named 
diction. If t he di ct i onary opened successf ul I y, t he mai n cont r ol f I ow of 
t he pr ogr am i s. . . 

...Several processes occur within the while statement for each sentence. 
From a broad view, the program extracts the sentence words and matches 
t hem wi t h the dictionary, which contains information about the word. The 
program loads this word information into arrays that it analyzes to 
deterrrine the underlying structure of. ..run, runs, ran, and running, the 
word run is the root. Each verb in the dictionary has a root. The root 
will later identify a group of si rri I ar words that may be used in a 
generated response sentence. An expanded natural language processor 
that generates many different responses would find the root invaluable. For 
exarrpl e, gi ven t he i nput . . . 
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March 8, 1994 

Language: English Record Type: Full text 
Document Type: Magazi ne/ Jour nal ; Trade 
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exceeds 98 percent, ignoring background noise as well as unf am I i ar 
words, allowing the user to input voice corrrrands as needed without the 
fear of substitution errors. 

Voi ceMouse has ununited vocabulary size, and the interface monitors 
current active application and adjusts the vocabulary to match that 
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application word -to- word or macr o- 1 o- rracr o. A recognized voice 

corrrrand either can launch another application or invoke a keyboard 

macro. The software operates with all popular 8 and 16- bit... 

...between words. Verbex said users can skip through Wndows layers wi t h a 
single spoken phrase, start a program by saying its name or enter data 
i n a spr eadsheet . 

The syst em I ear ns an i ndi vi dual . . . 
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different price points, differentiated mainly by the nurrber of 
words that each contains in its dictionary . 
Descr i pt i on 

The speech - recognition software enables control of Wndows using 
voice corrmands and dictation into nearly any application, including W>rd, 
Excel and WordPerfect. The Classic Edition includes a 30,000- word active 
vocabulary list and a 100,000+ word backup dictionary. Dr agonDi ct at e is 
available in British English, German. ..to run after installation and 
setup, the engineer found running the tutorial and quick training 
prograrrs necessary before getting started . Distinct speech (spoken 
speech where there is a pause after every word) is necessary for... 
...Test Center engineers were irrpressed wi t hDr agonDi ct at e' s spoken- word 
recognition speed. Also irrpressive was the program's automatic checklist 
feature. Wien it hears a word it does not recognize, it prorrpts from a 
list of known words for selection of a possible match or the addition 
of a newly spoken one. As words are accepted, Dr agonDi ct at e "learns" and 



